Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Problem loading Meta-Llama-3-70B-Instruct-Q8_0-00003-of-00003.gguf #159

Open
jmbejar opened this issue Jul 19, 2024 · 3 comments
Open

Problem loading Meta-Llama-3-70B-Instruct-Q8_0-00003-of-00003.gguf #159

jmbejar opened this issue Jul 19, 2024 · 3 comments
Assignees

Comments

@jmbejar
Copy link
Collaborator

jmbejar commented Jul 19, 2024

When attempting to load this specific model, the following error logs can be seen. This is currently breaking the app (not hard crash but the app becomes unresponsive).

[2024-07-19T20:41:16Z INFO  llama-core] Initializing the core context
[2024-07-19 17:41:16.071] [info] [WASI-NN] GGML backend: LLAMA_COMMIT 5e116e8d
[2024-07-19 17:41:16.071] [info] [WASI-NN] GGML backend: LLAMA_BUILD_NUMBER 3405
[2024-07-19 17:41:16.072] [error] [WASI-NN] llama.cpp: llama_model_load: error loading model: illegal split file: 2, model must be loaded with the first split
[2024-07-19 17:41:16.072] [error] [WASI-NN] llama.cpp: llama_load_model_from_file: failed to load model
[2024-07-19 17:41:16.072] [error] [WASI-NN] GGML backend: Error: unable to init model.
[2024-07-19T20:41:16Z ERROR llama-core] Backend Error: WASI-NN Backend Error: Caller module passed an invalid argument
Error: Operation("Backend Error: WASI-NN Backend Error: Caller module passed an invalid argument")

This is not a problem related with recent changes in the wasmedge version. The following error lines were produced in a recent version:

[INFO] Log prompts: false
[INFO] Log statistics: false
[INFO] Log all information: false
[2024-07-18 19:29:47.868] [error] [WASI-NN] GGML backend: Error: unable to init model.
Error: "Fail to load model into wasi-nn: Backend Error: WASI-NN Backend Error: Caller module passed an invalid argument"
@jmbejar
Copy link
Collaborator Author

jmbejar commented Jul 19, 2024

Assigning to myself as well because there is a failure in the frontend to handle the backend error gracefully.

@juntao
Copy link
Collaborator

juntao commented Jul 20, 2024

The 70b model file is too large for the git large file system. So, it is broken into 3 files. We need to stitch them back together into one file after downloading.

This error shows that it tries to start up the third partial file in the set without the first two.

@guofoo
Copy link
Collaborator

guofoo commented Jul 23, 2024

So is the BE handling this stitching together of the files or the FE?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

5 participants