-
Notifications
You must be signed in to change notification settings - Fork 21
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
balsam job rm --all
fails to delete some jobs with no warning
#364
Comments
Was the site active when you tried this? |
It might have been. Does that typically prevent jobs from being deleted? |
Hm, I did some tests, and it looks like the site being active doesn't affect this after all. Are you able to delete them individually, using their job ids? Or does that fail as well? |
I ended up deleting the site as a workaround for now so I'm unable to answer that. I can post more details about what happens by doing so if I encounter this issue again. |
Hit the same bug again today, so here are the results of the approaches mentioned above: On an active site:
No, these jobs are strangely stuck and cannot be deleted without deleting the whole site. Let me know if helping the definition of the applications/jobs would help. |
Another observation I have is that the failure of job deletion seems to be associated with the daemon hanging. Seeing the above failure mode reproduced on a different login node (
where balsam thinks the site is active, but syncing it fails, indicating that the daemon is unresponsive ( hanging?):
I'm unsure if this is related to #340 |
Few questions,
|
|
Hey, I realize now that your comments on issue #340 answer some of these questions, I'll shift over my discussion there. |
On
theta
, attempting to delete all jobs from a site viabalsam job rm --all
sometimes fails with no indication that the job was not removed:The development version of balsam was installed via:
The text was updated successfully, but these errors were encountered: