Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Python Library for configure_slurm.py. #314

Open
jmchilton opened this issue Apr 29, 2017 · 6 comments
Open

Python Library for configure_slurm.py. #314

jmchilton opened this issue Apr 29, 2017 · 6 comments

Comments

@jmchilton
Copy link
Collaborator

It'd be nice if we had a uniform set of variables and such for dealing with this inside and outside of Ansible as well as inside and outside of Docker (e.g. the original place this script was developed I think was Pulsar testing years ago - https://github.com/galaxyproject/pulsar/blob/master/scripts/configure_test_slurm.py). And it'd be nice if pip install slurm_configure==<version> was used for version handling across all these projects.

@bgruening
Copy link
Owner

What about ephemeris or ansible-extras?

@jmchilton
Copy link
Collaborator Author

@bgruening I don't really like either method - ansible-galaxy-extras isn't a library that can be readily used by Pulsar testing for instance and ephemeris is should ultimately be galaxy-centric and admin-centric I would think. This script is useful outside the context of Galaxy. I get the desire to keep things simple though.

@bgruening
Copy link
Owner

Ok, makes sense. Under galaxyproject or my account - this will answer the question you or me ;)

@jmchilton
Copy link
Collaborator Author

I was thinking galaxyproject or my account - I was thinking about this as a @jmchilton issue.

@bgruening
Copy link
Owner

Go for it!

@chambm
Copy link
Contributor

chambm commented Jun 8, 2017

I noticed this issue and didn't know where else to post about configure_slurm.py, so I'll post here. On the Galaxy Jetstream image, SLURM can be pretty finicky about getting the hostname right. I've even seen it report having an old IP when the instance is redeployed, e.g. I've seen something like:

root@js-56-78:~# hostnamectl
   Static hostname: js-12-34
Transient hostname: js-56-78.jetstream-cloud.org

Should configure_slurm.py handle this kind of quirk, or does the Jetstream image itself need some additional hostname finagling?

Less Jetstream specific, can I also recommend that configure_slurm.py set up SlurmDBD to keep track of jobs between reboots? It's frustrating to have a job counted as successful because the instance crashed and Galaxy can't find the job in SLURM's history (so it assumes it completed successfully).

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants