Update SLURM builder #323

jarlsondre · 2025-02-10T10:37:17Z

Summary

Updates to the itwinai SLURM builder. See issue #288 for more information.

Brief overview:

Add SLURM Builder to CLI
Use tempdir when --no-save-script is active
Add some documentation for the builder in the existing SLURM command documentation
Add some tests for the SLURM builder
Use pytest's tmp_path instead of tempfile's TemporaryDirectory wherever possible
Remove old slurm.sh and runall.sh files from use cases
Add training command with overridable string, removing the need for separate slurm.py files in EURAC and Virgo
General clean up such as changing the type hints in the TorchTrainer etc.
Possibly some more small stuff 😁

Related issues : #288, #316 #318, #319, #320, #324, #325

…i into update-slurm-builder

matbun · 2025-02-13T10:53:37Z

src/itwinai/cli.py

+        typer.Option(
+            help=("Which directory to search for the scalability metrics in.")
+        ),


our formatters will keep fighting on this forever hahaha

Haha yeah I think I have my line length a bit shorter than yours. Mine doesn't use the project-wide one I think

matbun · 2025-02-13T12:59:45Z

src/itwinai/cli.py

+@app.command(
+    context_settings={"allow_extra_args": True, "ignore_unknown_options": True}
+)
+def generate_slurm():


it would be nice to replicate arguments and options of the argument parser here as well. This would generate a nice help page, helping the users to understand what options they can use and what type they receive. You can take inspiration from exec_pipeline

This is possible and might be better since the Typer help page is pretty, but since I am using the argparser, doing -h actually works atm and will tell you all the arguments that exist. The format will be slightly different from Typer's, though.

I see two limitations with this solution. First, I guess that if you use --help instead of -h, this help page will not appear -- a bit inconsistent and may generate confusion. Second, the CLI reference is not generated in the dedicated docs page

Regarding the first limitation: This is possible to change by simply setting add_help_option=False.

The CLI reference might not be as easy to fix. We could just add them as arguments, but there are 23 of them in total. Looking at the other Annotated variables they take between 1 and 8 lines each, where most seem to take 5-6. This would result in around 100-150 lines of variables. Certainly doable and might be the best fix, but certainly worth thinking about. What do you think? Should I add them? 😄

I suppose the biggest downside would be that we would have all the descriptions in two places, as well as all the variables. We could solve this by removing the descriptions from the argparser and maybe having an automatic test (in tests/) that checks that the params are the same between the argparser and the function? Just thinking out loud here.

Another option is to change the parsing mechanism to use the newest parser. That way we wouldn't have the whole double thing. The disadvantage is that it could potentially be a lot of work for something that already works the way it is.

jarlsondre added 28 commits February 10, 2025 11:35

#318 add slurm builder to cli

104a5df

#319 #320 — add tempdir, print and some cleanup

19bacf9

change retain_file to save_script

28dec9c

#324 add tests for the slurm script builder

dde249f

#324 update tests to use pytest tmp_path fixture

0823576

#316 add documentation for the SLURM script builder

833b8ba

fix linting errors

f36170e

remove some slurm.sh files and update builder with runall etc

f01f345

add slurm config to mnist use case

8ab8f10

add default training command to builder + cleanup

d28b071

add more type hints and add caption to docs

e483706

add caption to example slurm config in docs

778e2f4

add more type hints and change some var names in test

d843554

remove mnist files

36bd6c7

update gitignore to remove mnist files in tut

21d7851

add pytorch env to pytest workflow

ea7c83d

fix tempdir issue with slurm builder test

28e4981

lint tests

f5cd512

#325 use tmp_path instead of tempdir where possible

abd2c30

fix eurac config

229d156

Merge branch 'update-slurm-builder' of github.com:interTwin-eu/itwina…

df40ca0

…i into update-slurm-builder

start implementing dynamic override for training cmd

90dfd1e

#317 update to use the slurm builder for torch use cases

881131d

fix linting errors

be630b4

update gitignore + workflows + readme

ea5e41a

small cleanup

e1ae900

Merge branch 'main' into update-slurm-builder

0d4e511

fix docs

4568ddb

jarlsondre requested review from matbun and removed request for matbun February 13, 2025 08:26

jarlsondre requested review from annaelisalappe and matbun February 13, 2025 08:26

jarlsondre self-assigned this Feb 13, 2025

jarlsondre added documentation Improvements or additions to documentation enhancement New feature or request clean up labels Feb 13, 2025

jarlsondre modified the milestones: itwinai 0.2, itwinai 0.3 Feb 13, 2025

jarlsondre marked this pull request as ready for review February 13, 2025 08:33

jarlsondre changed the title ~~[DRAFT] Update SLURM builder~~ Update SLURM builder Feb 13, 2025

matbun requested changes Feb 13, 2025

View reviewed changes

jarlsondre added 2 commits February 14, 2025 15:12

add missing default params to builder from config

3ac0c1e

update eurac settings to fix parser

af8fa38

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Update SLURM builder #323

Update SLURM builder #323

jarlsondre commented Feb 10, 2025 •

edited

Loading

matbun Feb 13, 2025

jarlsondre Feb 14, 2025

matbun Feb 13, 2025

jarlsondre Feb 13, 2025

matbun Feb 13, 2025

jarlsondre Feb 13, 2025

jarlsondre Feb 13, 2025

Update SLURM builder #323

Are you sure you want to change the base?

Update SLURM builder #323

Conversation

jarlsondre commented Feb 10, 2025 • edited Loading

Summary

matbun Feb 13, 2025

Choose a reason for hiding this comment

jarlsondre Feb 14, 2025

Choose a reason for hiding this comment

matbun Feb 13, 2025

Choose a reason for hiding this comment

jarlsondre Feb 13, 2025

Choose a reason for hiding this comment

matbun Feb 13, 2025

Choose a reason for hiding this comment

jarlsondre Feb 13, 2025

Choose a reason for hiding this comment

jarlsondre Feb 13, 2025

Choose a reason for hiding this comment

jarlsondre commented Feb 10, 2025 •

edited

Loading