qwen-14b 模型格式转换OOM #559

frankxyy · 2023-10-16T05:22:39Z

使用显卡: 四张A10 24G 显存显卡

使用代码: main分支最新代码

转换命令: python3 -m lmdeploy.serve.turbomind.deploy qwen-14b \ /home/xuyangyang/qwen-14b-chat qwen \ --tokenizer_path /home/xuyangyang/qwen-14b-chat/tokenizer.model \ --tp 4 \ --dst_path /home/xuyangyang/qwen-14b-chat_transformed_tp4

感觉按常理来说，14b模型转换不需要那么多显存吧？之前llama2-13b 是可以正常转换的

The text was updated successfully, but these errors were encountered:

lvhan028 · 2023-10-16T05:34:28Z

现在是需要。qwen-14b的vocab要比llama2-13b 大很多。
等 #296 做好后，能解决这个问题

frankxyy · 2023-10-16T06:21:33Z

@lvhan028 了解了，谢谢！

leethu2012 · 2023-10-16T08:45:18Z

使用显卡: 四张A10 24G 显存显卡

使用代码: main分支最新代码

转换命令: python3 -m lmdeploy.serve.turbomind.deploy qwen-14b \ /home/xuyangyang/qwen-14b-chat qwen \ --tokenizer_path /home/xuyangyang/qwen-14b-chat/tokenizer.model \ --tp 4 \ --dst_path /home/xuyangyang/qwen-14b-chat_transformed_tp4

感觉按常理来说，14b模型转换不需要那么多显存吧？之前llama2-13b 是可以正常转换的

请问tokenizer.model是怎么来的？从huggingface下载的模型没有这个文件啊

lvhan028 · 2023-10-17T08:28:38Z

转换的时候加上 --model-format qwen

frankxyy · 2023-10-18T08:23:25Z

@lvhan028 hi，我model_format设置为qwen后，在第36层后，还是oom了啊

lvhan028 · 2023-10-18T12:25:29Z

在 deploy_qwen() 中，get_tensor函数里的.cuda() 去掉试试

frankxyy · 2023-10-19T06:14:10Z

@lvhan028 可以了，谢谢！

lvhan028 closed this as completed Oct 17, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

qwen-14b 模型格式转换OOM #559

qwen-14b 模型格式转换OOM #559

frankxyy commented Oct 16, 2023 •

edited

Loading

lvhan028 commented Oct 16, 2023 •

edited

Loading

frankxyy commented Oct 16, 2023

leethu2012 commented Oct 16, 2023

lvhan028 commented Oct 17, 2023

frankxyy commented Oct 18, 2023

lvhan028 commented Oct 18, 2023

frankxyy commented Oct 19, 2023

qwen-14b 模型格式转换OOM #559

qwen-14b 模型格式转换OOM #559

Comments

frankxyy commented Oct 16, 2023 • edited Loading

lvhan028 commented Oct 16, 2023 • edited Loading

frankxyy commented Oct 16, 2023

leethu2012 commented Oct 16, 2023

lvhan028 commented Oct 17, 2023

frankxyy commented Oct 18, 2023

lvhan028 commented Oct 18, 2023

frankxyy commented Oct 19, 2023

frankxyy commented Oct 16, 2023 •

edited

Loading

lvhan028 commented Oct 16, 2023 •

edited

Loading