You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Hi, I'm building a single machine testing environment, because Minikube doesn't support GPU well, so I use the local-up-cluster.sh provided at https://github.com/kubernetes/kubernetes/blob/master/hack/local-up-cluster.sh to build up single node cluster, but it not work with k8s-device-plugin well.
Do the following to reproduce it:
get source code using go get -d k8s.io/kubernetes
In order to make the local-up-cluster.sh launch kubelet with gate options, I insert the following line to the top of local-up-cluster.sh FEATURE_GATES="DevicePlugins=true"
start the cluster using sudo ./hack/local-up-cluster.sh
When running docker run -it -v /var/lib/kubelet/device-plugins:/var/lib/kubelet/device-plugins nvidia/k8s-device-plugin:1.9
I got:
2018/02/28 12:14:51 Loading NVML
2018/02/28 12:14:51 Fetching devices.
2018/02/28 12:14:51 Starting FS watcher.
2018/02/28 12:14:51 Starting OS watcher.
2018/02/28 12:14:51 Starting to serve on /var/lib/kubelet/device-plugins/nvidia.sock
2018/02/28 12:14:51 Could not register device plugin: rpc error: code = Unimplemented desc = unknown service deviceplugin.Registration
2018/02/28 12:14:51 Could not contact Kubelet, retrying. Did you enable the device plugin feature gate?
2018/02/28 12:14:51 You can check the prerequisites at: https://github.com/NVIDIA/k8s-device-plugin#prerequisites
2018/02/28 12:14:51 You can learn how to set the runtime at: https://github.com/NVIDIA/k8s-device-plugin#quick-start
2018/02/28 12:14:51 Starting to serve on /var/lib/kubelet/device-plugins/nvidia.sock
2018/02/28 12:14:51 Could not register device plugin: rpc error: code = Unimplemented desc = unknown service deviceplugin.Registration
2018/02/28 12:14:51 Could not contact Kubelet, retrying. Did you enable the device plugin feature gate?
2018/02/28 12:14:51 You can check the prerequisites at: https://github.com/NVIDIA/k8s-device-plugin#prerequisites
2018/02/28 12:14:51 You can learn how to set the runtime at: https://github.com/NVIDIA/k8s-device-plugin#quick-start
2018/02/28 12:14:51 Starting to serve on /var/lib/kubelet/device-plugins/nvidia.sock
2018/02/28 12:14:51 Could not register device plugin: rpc error: code = Unimplemented desc = unknown service deviceplugin.Registration
2018/02/28 12:14:51 Could not contact Kubelet, retrying. Did you enable the device plugin feature gate?
2018/02/28 12:14:51 You can check the prerequisites at: https://github.com/NVIDIA/k8s-device-plugin#prerequisites
2018/02/28 12:14:51 You can learn how to set the runtime at: https://github.com/NVIDIA/k8s-device-plugin#quick-start
2018/02/28 12:14:51 Starting to serve on /var/lib/kubelet/device-plugins/nvidia.sock
Here are some of my configs:
/etc/docker/daemon.json
You are using master (i.e soon to be 1.10) and the device plugin you are using is for 1.9.
Master recently broke the pluginapi before graduating to beta and we have yet to publish an implementation for 1.10 (which we'll be doing before the official release).
In the meantime you can just checkout the release-1.9 tag (git ck release-1.9) or the v1.9.3 tag
hello,i follow your step to build a single machine testing environment, all is work well , but when i use kubectl get nodes, i get no resource . how can i create a node ? Thanks !
Hi, I'm building a single machine testing environment, because Minikube doesn't support GPU well, so I use the
local-up-cluster.sh
provided athttps://github.com/kubernetes/kubernetes/blob/master/hack/local-up-cluster.sh
to build up single node cluster, but it not work withk8s-device-plugin
well.Do the following to reproduce it:
get source code using
go get -d k8s.io/kubernetes
In order to make the
local-up-cluster.sh
launchkubelet
with gate options, I insert the following line to the top oflocal-up-cluster.sh
FEATURE_GATES="DevicePlugins=true"
start the cluster using
sudo ./hack/local-up-cluster.sh
When running
docker run -it -v /var/lib/kubelet/device-plugins:/var/lib/kubelet/device-plugins nvidia/k8s-device-plugin:1.9
I got:
Here are some of my configs:
/etc/docker/daemon.json
The command line to run
kubelet
The output of
nvidia-smi
on the machine:The text was updated successfully, but these errors were encountered: