NVidia Model Testing #238
Replies: 8 comments 8 replies
-
|
@rishikanthc, it just looks like some python modules are missing. I am adding them and will see if it works. |
Beta Was this translation helpful? Give feedback.
-
|
As an aside. Large-v3 runs great. Very resource hungry though. To be expected. |
Beta Was this translation helpful? Give feedback.
-
|
Looks like we are working in the same area - I am also getting the I'm trying to fix this by using a different base image in @BarbellDwarf can you be a little more specific about how you manually installed the megatron module in the parakeet venv? Was this in the running container? |
Beta Was this translation helpful? Give feedback.
-
|
The configuration is currently initializing 4 models in the same |
Beta Was this translation helpful? Give feedback.
-
Beta Was this translation helpful? Give feedback.
-
|
I think there are capabilities of the nvidia container toolkit that are not being enabled. For example, I frequently work with an ollama server in a docker container behind an Open WebUI front end, and when I try to load something that won't fit into VRAM, such as gpt-oss:20b with a large context, the container toolkit splits the layers of the model between the GPU and the CPU, utilizing the CPU's RAM in addition to the GPU's VRAM. You can watch the model move back and forth between the CPU and the GPU using HTop and NVTop. I don't think this CPU Offloading is currently enabled by Scriberr. |
Beta Was this translation helpful? Give feedback.
-
|
Beta Was this translation helpful? Give feedback.
-
|
After adding |
Beta Was this translation helpful? Give feedback.


Uh oh!
There was an error while loading. Please reload this page.
-
So. I started testing out the new NVidia Parakeet and Canary models. And there are some issues. Here are the logs:
Canary Execution
Parakeet Execution:
it looks to be the same error for both
ModuleNotFoundError: No module named 'megatron, But wanted to get this up and ready before I begin troubleshooting.Beta Was this translation helpful? Give feedback.
All reactions