RuntimeError("Distributed package doesn't have NCCL " "built in") #377
-
Everything goes fine for me until I try to train C:\Users\jerme\anaconda3\envs\so-vitts-fork\python.exe: Error while finding module specification for 'main' (ValueError: main.spec is None) [W ..\torch\csrc\distributed\c10d\socket.cpp:601] [c10d] The client socket has failed to connect to [kubernetes.docker.internal]:49465 (system error: 10049 - The requested address is not valid in its context.). File "C:\Users\jerme\anaconda3\envs\so-vitts-fork\lib\site-packages\torch\distributed\distributed_c10d.py", line 998, in _new_process_group_helper Any help would be greatly appreciated, and I have no problem compensating anyone who can help me solve this issue. Thx |
Beta Was this translation helpful? Give feedback.
Replies: 1 comment
-
I believe this is resolved |
Beta Was this translation helpful? Give feedback.
I believe this is resolved