k3s 节点重启后，GPU虚拟化失效，nvidia.com/gpu 数目变回物理GPU数目，重启pod hami-device-plugin后，nvidia.com/gpu 数目恢复正常 #829

christu · 2025-01-23T09:46:10Z

What happened:
k3s 节点重启后，GPU虚拟化失效，nvidia.com/gpu 数目变回物理GPU数目，重启pod hami-device-plugin后，nvidia.com/gpu 数目恢复正常

What you expected to happen:

How to reproduce it (as minimally and precisely as possible):

Anything else we need to know?:

Environment:

The text was updated successfully, but these errors were encountered:

archlitchi · 2025-01-24T07:26:56Z

你应该是之前没卸载掉nvidiade device-plugin吧

christu · 2025-02-08T09:01:39Z

你应该是之前没卸载掉nvidiade device-plugin吧

先安装的nvidia GPU Opeartor，再安装的Hami，安装完hami之后需要把 nvidia-device-plugin-daemonset 卸载掉吗？

archlitchi · 2025-02-08T10:17:30Z

你应该是之前没卸载掉nvidiade device-plugin吧

先安装的nvidia GPU Opeartor，再安装的Hami，安装完hami之后需要把 nvidia-device-plugin-daemonset 卸载掉吗？

是的，除非你用自定义的资源名

christu added the kind/bug Something isn't working label Jan 23, 2025

Provide feedback