[develop]: RUN_FCST failures when using Jinja-templated values in predef_grid_params.yaml
#1006
Labels
bug
Something isn't working
predef_grid_params.yaml
#1006
Expected behavior
See GitHub Discussion #1000 for full context.
Since
ush/predef_grid_params.yaml
is expecting hard-coded values for its grid parameters, not Jinja-templated YAML {{...}} entries, experiment generation should fail with an appropriate error message when grid parameters are set to Jinja-templated values (e.g.,WRTCMP_write_tasks_per_group: '{{ LAYOUT_Y }}'
). Alternatively, the code should be refactored so thatush/predef_grid_params.yaml
accepts Jinja-templated values.Current behavior
If the user sets
WRTCMP_write_tasks_per_group: '{{ LAYOUT_Y }}'
, the experiment is generated, but the value ofNNODES_RUN_FCST
cannot be properly calculated, and the experiment fails atrun_fcst
with an error message similar to the following:var_defns.sh
in the failed SRW run showsNNODES_RUN_FCST='{{ (PE_MEMBER01 + PPN_RUN_FCST - 1) // PPN_RUN_FCST }}'
Hardcoding
WRTCMP_write_tasks_per_group
allows the experiment to run.Machines affected
Probably all, but certainly Derecho. See GitHub Discussion #1000 for full context.
Steps To Reproduce
Set the grid:
After generating the experiment, the
var_defns.sh
file shows:and the test fails with
qsub: directive error: -l select={{ task_run_fcst.NNODES_RUN_FCST // 1 }}:mpiprocs=128:ncpus=128
.To correct this behavior, it is necessary to hard code
WRTCMP_write_tasks_per_group
to a particular value inush/predef_grid_params.yaml
.Detailed Description of Fix (optional)
Additional Information (optional)
Possible Implementation (optional)
Output (optional)
The text was updated successfully, but these errors were encountered: