-
Notifications
You must be signed in to change notification settings - Fork 176
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[8.0] Proper handling of waiting jobs when set to be killed #7690
Conversation
I think the current indentation is correct, for eg. SUBMITTING is not allowed to go to KILLED:
WAITING is allowed to go to KILLED since #7276, so it should be moved from this list of job statuses which are not allowed to go to KILLED to that which can go to KILLED. Or, can't the possible states transitions (https://github.com/DIRACGrid/DIRAC/blob/rel-v8r0/src/DIRAC/WorkloadManagementSystem/Client/JobStatus.py) be checked instead of these hard-coded list? |
I think @iueda is correct. For checking the state transitions,
|
@fstagni we have a fundamental question before moving on with this PR or a more elaborated implementation. When you have a job in Waiting status, is it meaningful to use I ask because with the current indentation, only statuses not hard-coded are appended in the Moving the Waiting status to line 520 would add them to |
|
According to WorkloadManagementSystem/Client/JobStatus.py, the states that can go to Killed are
The current code does not kill jobs in 'WAITING' state
Do we want
If the latter (b), then we would need to keep the hard-coding. |
I would go for option "a". |
Sorry for the delay @fstagni @iueda . I am back on this issue. Now the new proposal uses Also, all jobs to be killed (after the filtering) go to |
…out sending killCommand
…killed or deleted
6e3e831
to
e9ec307
Compare
I took the freedom to push for adding a unit test. |
e9ec307
to
4cf8434
Compare
Thanks a lot! |
Sweep summary Sweep ran in https://github.com/DIRACGrid/DIRAC/actions/runs/10367347752 Failed:
|
When jobs are in status SUBMITTING, WAITING, etc, and they are set to be killed, they are not added to the list
markKilledJobList
. This pull request fix the identation whenkill
instead ofdelete
is used.BEGINRELEASENOTES
*WMS
FIX: Proper killing of jobs when not matched, running or stalled
ENDRELEASENOTES