Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

desi_submit_prod rechecks running jobs hundreds of times #2347

Closed
sbailey opened this issue Aug 27, 2024 · 1 comment
Closed

desi_submit_prod rechecks running jobs hundreds of times #2347

sbailey opened this issue Aug 27, 2024 · 1 comment
Assignees
Labels

Comments

@sbailey
Copy link
Contributor

sbailey commented Aug 27, 2024

desi_submit_prod starts with log messages like:

INFO:queue.py:533:get_jobs_in_queue: Querying Slurm with the following: squeue -u desi -o "%i,%P,%j,%u,%t,%M,%D,%R"
INFO:submit_prod.py:257:submit_production: night=20201214 already has a proctable, skipping.
INFO:queue.py:533:get_jobs_in_queue: Querying Slurm with the following: squeue -u desi -o "%i,%P,%j,%u,%t,%M,%D,%R"
INFO:submit_prod.py:257:submit_production: night=20201215 already has a proctable, skipping.
INFO:queue.py:533:get_jobs_in_queue: Querying Slurm with the following: squeue -u desi -o "%i,%P,%j,%u,%t,%M,%D,%R"
INFO:submit_prod.py:257:submit_production: night=20201216 already has a proctable, skipping.
INFO:queue.py:533:get_jobs_in_queue: Querying Slurm with the following: squeue -u desi -o "%i,%P,%j,%u,%t,%M,%D,%R"
INFO:submit_prod.py:257:submit_production: night=20201217 already has a proctable, skipping.
...

I.e. it appears to be invoking squeue to check the currently running jobs for every night that it is skipping while trying to find the next time to submit. This adds up to a lot of slurm load and delayed start of actual submission while trying to find a night that needs to be submitted.

Check if this squeue check could be run once, skip over N nights already submitted, and then proceed.

@sbailey
Copy link
Contributor Author

sbailey commented Aug 28, 2024

fixed by PR #2348; closing.

@sbailey sbailey closed this as completed Aug 28, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Projects
None yet
Development

No branches or pull requests

2 participants