OOM error when i tried to save model in q_8 gguf #1721

Mracobes9 · 2025-02-15T20:37:20Z

Hi. I'm facing the following problem. After training a model, I get an OOM error when I try to save the result in GGUF Q_8 format. Could you please advise how this can be avoided ?

shimmyshimmer · 2025-02-16T03:25:30Z

You can followthis and see if it works: https://docs.unsloth.ai/basics/errors#saving-to-gguf-vllm-16bit-crashes

"You can try reducing the maximum GPU usage during saving by changing maximum_memory_usage.

The default is model.save_pretrained(..., maximum_memory_usage = 0.75). Reduce it to say 0.5 to use 50% of GPU peak memory or lower. This can reduce OOM crashes during saving."

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

OOM error when i tried to save model in q_8 gguf #1721

OOM error when i tried to save model in q_8 gguf #1721

Mracobes9 commented Feb 15, 2025

shimmyshimmer commented Feb 16, 2025

OOM error when i tried to save model in q_8 gguf #1721

OOM error when i tried to save model in q_8 gguf #1721

Comments

Mracobes9 commented Feb 15, 2025

shimmyshimmer commented Feb 16, 2025