LM Studio Granite GGUF model loading error #7

dukeinps · 2024-05-21T16:15:43Z

dukeinps
May 21, 2024

Thank you for the GGUF models on huggingface. They download and have the correct parameters to run but but sadly they fail to load in LM Studio. Getting this error:
"llama.cpp error: 'check_tensor_dims: tensor 'output.weight' not found'" In researching this on the llama.cpp github discussions, there is no clear answer. The message infers that something is missing or incomplete.

Answered by mayank31398

May 21, 2024

Hi, currently you need to use llama.cpp from main branch I think.
I am unsure if ollama is supported currently.

View full answer

dprosper · 2024-05-21T17:22:08Z

dprosper
May 21, 2024

similar issue attempting to use Ollama, the error is sliglty different then above.

error loading model vocabulary: unknown pre-tokenizer type: 'refact

in contrast one from instructlab granite-7b-lab-GGUF works fine.

0 replies

mayank31398 · 2024-05-21T19:35:36Z

mayank31398
May 21, 2024
Maintainer

Hi, currently you need to use llama.cpp from main branch I think.
I am unsure if ollama is supported currently.

0 replies

akiemiq4g · 2024-05-28T19:59:26Z

akiemiq4g
May 28, 2024

I am running on Ollama pre-release version 0.1.39 and that is how I got around @dprosper the refact error but the chat response has not been properly parsed. Therefore, it looks like this but I bet it probably looks better with the proprietary app IBM developed to use their model:

I feel like I'll end up just downloading the IBM app just to get my feet wet with it but not a fan of this initial road of installing it. But if you intend on integrating it into an app and will be doing parsing downstream of your api calls then this should work for you. I just had to download this version: https://github.com/ollama/ollama/releases/tag/v0.1.39 and install it by dragging it into my Applications folder on Mac overwriting my previous 0.1.38 version. I will probably revert back as I'd rather run on the stable version as I'm not developing apps that require any pre-release functionality as of yet and not sure if it has caused any side-effects to the other models I've been using via Ollama server.

1 reply

mayank31398 May 29, 2024
Maintainer

I see. I have tried LM Studio btw.
Its working fine for me.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

IBM Granite

LM Studio Granite GGUF model loading error #7

{{title}}

Replies: 3 comments 1 reply

{{title}}

{{title}}

{{title}}

{{title}}

Select a reply

IBM Granite

LM Studio Granite GGUF model loading error #7

dukeinps May 21, 2024

Replies: 3 comments · 1 reply

dprosper May 21, 2024

mayank31398 May 21, 2024 Maintainer

akiemiq4g May 28, 2024

mayank31398 May 29, 2024 Maintainer

dukeinps
May 21, 2024

Replies: 3 comments 1 reply

dprosper
May 21, 2024

mayank31398
May 21, 2024
Maintainer

akiemiq4g
May 28, 2024

mayank31398 May 29, 2024
Maintainer