This repository has been archived by the owner on Nov 2, 2021. It is now read-only.
-
Notifications
You must be signed in to change notification settings - Fork 302
Issues: NVIDIA/gpu-monitoring-tools
Author
Label
Projects
Milestones
Assignee
Sort
Issues list
dcgm exporter doesn't monitor mig disabled gpus with mixed strategy
#211
opened Sep 8, 2021 by
chloejiwon
dcgm-exporter running on "g4dn.metal" in AWS EKS fails with "fatal: morestack on gsignal"
#208
opened Aug 13, 2021 by
SQUIDwarrior
How can the k8s cluster monitor the GPU model name?
#206
opened Aug 5, 2021 by
jgsetogetifshjcdgjvxdgh
Log spam in nv-hostengine.log due to ReadNvSwitchStatusAllSwitches() returned No data is available
#194
opened Jun 9, 2021 by
jfolz
Installing the dgcm-exporter with Helm3 on OpenShift faces permissions issues
#191
opened Jun 2, 2021 by
vemonet
dcgm-exporter reports stale metrics if nvhost-engine is restarted
#188
opened May 12, 2021 by
bchess
Why duplicate metrics occured when a job scheduling to this server
#174
opened Apr 1, 2021 by
WYmindsky
dcgm-exporter unit test depends on k8s.io/kubernetes directly as a library
#172
opened Mar 30, 2021 by
shatil
Previous Next
ProTip!
Adding no:label will show everything without a label.