Error when evaluating the qlora-merged model. #427

vhientran · 2024-11-12T08:04:44Z

Hi Authors,

Thanks so much for releasing the great source code. I finetuned LlaMA-2-7B using QLoRA successfully on a node of 8GPUs A100 NVIDIA, and saved the merged model well. However, when loading this created model with vLLM, I got the error below:

model = vllm.LLM(
[rank0]:             ^^^^^^^^^
[rank0]:   File "/miniconda3/envs/open-instruct/lib/python3.11/site-packages/vllm/entrypoints/llm.py", line 177, in __init__
[rank0]:     self.llm_engine = LLMEngine.from_engine_args(
[rank0]:                       ^^^^^^^^^^^^^^^^^^^^^^^^^^^
[rank0]:   File "/miniconda3/envs/open-instruct/lib/python3.11/site-packages/vllm/engine/llm_engine.py", line 573, in from_engine_args
[rank0]:     engine = cls(
[rank0]:              ^^^^
[rank0]: **KeyError: 'layers.11.mlp.down_proj.weight'**
Loading safetensors checkpoint shards:   0% Completed | 0/3 [00:00<?, ?it/s]

It seems the difference about the structure when loading the model. Could you please give me some suggestions to fix it? Many thanks for your help!

The text was updated successfully, but these errors were encountered:

hamishivi · 2025-01-08T00:18:23Z

Hi, are your vLLM and transformers vesrion up to date? Sometimes mismatches there can cause issues. Otherwise, its hard to say without more details - do you have example commands you used for training and merging?

hamishivi closed this as completed Feb 1, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Error when evaluating the qlora-merged model. #427

Error when evaluating the qlora-merged model. #427

vhientran commented Nov 12, 2024

hamishivi commented Jan 8, 2025 •

edited

Loading

Error when evaluating the qlora-merged model. #427

Error when evaluating the qlora-merged model. #427

Comments

vhientran commented Nov 12, 2024

hamishivi commented Jan 8, 2025 • edited Loading

hamishivi commented Jan 8, 2025 •

edited

Loading