Provide a way to access the GRES_IDX attribute for a job #116

chrissamuel · 2018-07-03T11:38:24Z

Details

Slurm Version: 17.11.7
Python Version: 3.4 (RHEL7)
Cython Version: ??
PySlurm Branch: 17.11.0
Linux Distribution: RHEL7

Issue

It would be really useful to us for our job monitoring program to be able to get the GRES_IDX information from running jobs so we can highlight the GPU the job is using for our users so they can see how much utilisation it is getting.

Our issue for this is: conradtchan/jobmon#1

This looks like GRES_IDX=gpu(IDX:0) or GRES_IDX=gpu(IDX:1) or GRES_IDX=gpu(IDX:0-1) for our systems with dual GPUs. I'm not sure what it would look like for a system with say 4 GPUs if the allocation is not contiguous.

Thanks for considering this!

All the best,
Chris

The text was updated successfully, but these errors were encountered:

giovtorres · 2018-07-03T12:30:26Z

Thanks. I was hoping this one would be easy. Here is the code that needs to get wrapped: https://github.com/SchedMD/slurm/blob/5073024350eb79c8c5a9964e800bc0ce3ab93d59/src/api/job_info.c#L706-L803.

That may take me some time.

How would you like the output of this attribute, as one string that matches the scontrol output, or a list of strings if there are more than one?

chrissamuel · 2018-07-12T00:43:36Z

Sorry for not spotting this before, I've just passed your query on to the developer!

chrissamuel · 2018-08-06T09:00:35Z

Got a reply from the developer today, he said:

Sorry for the late reply... a list of strings would be great!

Hope that helps!
Chris

giovtorres · 2018-12-03T04:28:48Z

Hi @chrissamuel. I know it's been a while, but I think I got the code wrapped to get GRES_IDX in one of the 18.08 branches I'm working on. I should be able to backport to an older branch. Are you still on Slurm 17.11.7?

chrissamuel · 2018-12-12T17:50:40Z

Hi @giovtorres!

Swinburne is on 18.08.x now, but I've since moved to the US for love and for work and am now at NERSC (still doing HPC). But I'll still see updates here and let them know.

Thanks for this!
Chris

giovtorres · 2018-12-13T01:19:05Z

The gres_idx branch should work with a later version of Cython. I'll merge it into master soon after I figure out why it is failing on older Cython versions.

chrissamuel · 2018-12-14T04:33:03Z

Thanks! Passed that back to them.

giovtorres added the Missing Attribute label Jul 3, 2018

giovtorres added a commit that referenced this issue Jul 21, 2018

Copy job_resources.h to redeclare job_resources struct (#116)

27d4273

giovtorres added a commit that referenced this issue Dec 8, 2018

Copy job_resources.h to redeclare job_resources struct (#116)

46fb656

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Provide a way to access the GRES_IDX attribute for a job #116

Provide a way to access the GRES_IDX attribute for a job #116

chrissamuel commented Jul 3, 2018

giovtorres commented Jul 3, 2018

chrissamuel commented Jul 12, 2018

chrissamuel commented Aug 6, 2018 •

edited

Loading

giovtorres commented Dec 3, 2018

chrissamuel commented Dec 12, 2018

giovtorres commented Dec 13, 2018

chrissamuel commented Dec 14, 2018

Provide a way to access the GRES_IDX attribute for a job #116

Provide a way to access the GRES_IDX attribute for a job #116

Comments

chrissamuel commented Jul 3, 2018

Details

Issue

giovtorres commented Jul 3, 2018

chrissamuel commented Jul 12, 2018

chrissamuel commented Aug 6, 2018 • edited Loading

giovtorres commented Dec 3, 2018

chrissamuel commented Dec 12, 2018

giovtorres commented Dec 13, 2018

chrissamuel commented Dec 14, 2018

chrissamuel commented Aug 6, 2018 •

edited

Loading