Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

GPU selection for jobs that use more than 1 gpu #4

Open
yassersouri opened this issue Jul 25, 2019 · 2 comments
Open

GPU selection for jobs that use more than 1 gpu #4

yassersouri opened this issue Jul 25, 2019 · 2 comments

Comments

@yassersouri
Copy link

For jobs that need more than 1 gpu, it is beneficial if the gpus selected are dependent on the topology of the gpu p2p connections.

This information can be seen for example from nvidia-smi topo -m.

Currently it seems that the gpus are selected randomly.

@yassersouri
Copy link
Author

yassersouri commented Jul 25, 2019

I think an intermediate solution would be if the server allocated gpus in order.
For example if gpus 0, 1, 4, 5, 6, 7 are available, it is better if the server allocates 4, 5, 6, 7.

This is because usually the p2p connections are between gpus in order. Like for example in cvg32 machine in our lab, gpus 0-3 and 4-7 are connected with good p2p connections.

@alexanderrichard
Copy link
Owner

This should be easy to do. I don't have time this weekend but might be able to look into it next weekend.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants