Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

nvidia-ctk returns undefined symbol: nvmlComputeInstanceDestroy #557

Open
davidshen84 opened this issue Jun 23, 2024 · 1 comment · Fixed by #732
Open

nvidia-ctk returns undefined symbol: nvmlComputeInstanceDestroy #557

davidshen84 opened this issue Jun 23, 2024 · 1 comment · Fixed by #732
Assignees
Labels
bug Issue/PR to expose/discuss/fix a bug

Comments

@davidshen84
Copy link

Hi,

I build the source code using the Makefile without any changes. I pulled the 1.15.0 tag. The build script was executed successfully and outputted all the binary files.

But when I try to create the config file using nvidia-ctk, I get the following error.

> sudo nvidia-ctk --quiet config --config-file=/etc/nvidia-container-runtime/config.toml --in-place
nvidia-ctk: symbol lookup error: nvidia-ctk: undefined symbol: nvmlComputeInstanceDestroy

Here's the nvidia-smi output.

Sun Jun 23 18:34:38 2024
+-----------------------------------------------------------------------------------------+
| NVIDIA-SMI 550.90.07              Driver Version: 550.90.07      CUDA Version: 12.4     |
|-----------------------------------------+------------------------+----------------------+
| GPU  Name                 Persistence-M | Bus-Id          Disp.A | Volatile Uncorr. ECC |
| Fan  Temp   Perf          Pwr:Usage/Cap |           Memory-Usage | GPU-Util  Compute M. |
|                                         |                        |               MIG M. |
|=========================================+========================+======================|
|   0  NVIDIA GeForce GTX 1050        Off |   00000000:01:00.0 Off |                  N/A |
| N/A   39C    P8             N/A / ERR!  |       0MiB /   4096MiB |      0%      Default |
|                                         |                        |                  N/A |
+-----------------------------------------+------------------------+----------------------+

+-----------------------------------------------------------------------------------------+
| Processes:                                                                              |
|  GPU   GI   CI        PID   Type   Process name                              GPU Memory |
|        ID   ID                                                               Usage      |
|=========================================================================================|
|  No running processes found                                                             |
+-----------------------------------------------------------------------------------------+

I know the nvidia driver version is a bit old. The latest is 555, but on Gentoo, the latest is still in 550 range.

I found this #49, which mentioned a similar error but with a different undefined symbol. I wonder if they are related.

@elezar
Copy link
Member

elezar commented Oct 11, 2024

We have created #732 and #736 to address this.

This was linked to pull requests Oct 11, 2024
@elezar elezar self-assigned this Oct 11, 2024
@elezar elezar added the bug Issue/PR to expose/discuss/fix a bug label Oct 11, 2024
@elezar elezar removed a link to a pull request Oct 11, 2024
@elezar elezar linked a pull request Oct 11, 2024 that will close this issue
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug Issue/PR to expose/discuss/fix a bug
Projects
None yet
2 participants