Device plugin not starting and pod showing 0/1 nodes are available: 1 node(s) had untolerated taint {gpu: }. preemption: 0/1 nodes are available: 1 Preemption is not helpful for scheduling..
#421
Labels
lifecycle/stale
Denotes an issue or PR has remained open with no activity and has become stale.
Hello there, I am facing an issue while integrating an Nvidia GPU with a pod.
I have successfully set up the
nvidia-container-toolkit
and I am using the5.15.0-76-generic
Linux kernel.I have setup a
kind-control-plane
node onkubectl
.I have labeled the node with
gpu=installed
And I have also tained the node with
Because without this the plugin was complaining for the node to be not a GPU node:
So after labeling and tainting the node we have the following node description:
Note that in the above description, the node does not have
nvidia.com/gpu
listed anywhere.The description of the gpu-pod is as follows:
But now, after the labeling and taint, the daemon does not even start or rather show up in the
kubectl get pods -n kube-system
command.Trying to create it again throws the following error:
Please, can anyone help me out?
The text was updated successfully, but these errors were encountered: