-
Notifications
You must be signed in to change notification settings - Fork 5
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Investigate the impact of NVIDIA MPS #307
Comments
Here is a quick performance comparison running the same jobs with and without MPS. System configuration
Native performance, without MPSRunning 10 times over 4200 events with 1 jobs, each with 16 threads, 16 streams and 1 GPUs
Running 10 times over 4200 events with 2 jobs, each with 8 threads, 8 streams and 1 GPUs
Performance with MPSRunning 10 times over 4200 events with 1 jobs, each with 16 threads, 16 streams and 1 GPUs
Running 10 times over 4200 events with 2 jobs, each with 8 threads, 8 streams and 1 GPUs
|
It looks like MPS incurs in a performance penalty around 5%. However, so far it seems to be the only way to run multiple jobs - which actually recovers part of the penalty. |
Notes: how to run with MPSStart the MPS daemonRun the server as a daemon in the background:
or as a process in the foreground:
Start an MPS serverStart a server for the current user:
Run CUDA jobsRun the CUDA jobs normally. Stop the MPS daemon
|
See also some documentation at https://patatrack.web.cern.ch/patatrack/wiki/MPS-setup/ . |
No description provided.
The text was updated successfully, but these errors were encountered: