Fix GGUF #654

danielhanchen · 2024-06-16T04:51:34Z

No description provided.

…a string when saving to gguf (#651) * Nightly (#649) * Update llama.py * offload * Update llama.py * Update llama.py * Update llama.py * Update llama.py * Update llama.py * Update llama.py * Update llama.py * continued pretraining trainer * Update trainer.py * Update trainer.py * Update trainer.py * Update trainer.py * is_bfloat16_supported * Update __init__.py * Update README.md * Update llama.py * is_bfloat16_supported * Update __init__.py * Mistral v3 * Phi 3 medium * Update chat_templates.py * Update chat_templates.py * Phi-3 * Update save.py * Update README.md Mistral v3 to Mistral v0.3 * Untrained tokens * Update tokenizer_utils.py * Update tokenizer_utils.py * Update tokenizer_utils.py * Update tokenizer_utils.py * Update tokenizer_utils.py * Update tokenizer_utils.py * Update tokenizer_utils.py * Update tokenizer_utils.py * Update tokenizer_utils.py * Update tokenizer_utils.py * Update tokenizer_utils.py * Update tokenizer_utils.py * Update tokenizer_utils.py * Update tokenizer_utils.py * Update tokenizer_utils.py * Update tokenizer_utils.py * Update tokenizer_utils.py * Update tokenizer_utils.py * Update tokenizer_utils.py * Update llama.py * Update tokenizer_utils.py * Update tokenizer_utils.py * Update tokenizer_utils.py * Update tokenizer_utils.py * Update save.py * Update save.py * Update save.py * checkpoint * Update _utils.py * Update tokenizer_utils.py * Update tokenizer_utils.py * Update tokenizer_utils.py * Update llama.py * accelerate * Update _utils.py * Update _utils.py * Update _utils.py * Update _utils.py * Update _utils.py * Update _utils.py * Update _utils.py * Update tokenizer_utils.py * train_dataloader * Update llama.py * Update llama.py * Update llama.py * use_fast_convert * Update save.py * Update save.py * Update save.py * Update save.py * remove_special_tokens * Ollama * Update chat_templates.py * Update chat_templates.py * Update chat_templates.py * Update llama.py * Update chat_templates.py * Support bfloat16 GGUF * Update save.py * Update llama.py * fast_forward_inference * Update mapper.py * Update loader.py * Update llama.py * Update tokenizer_utils.py * info * edits * Create chat template * Fix tokenizer * Update tokenizer_utils.py * fix case where gguf saving fails due to first_conversion dtype (#630) * Support revision parameter in FastLanguageModel.from_pretrained (#629) * support `revision` parameter * match unsloth formatting of named parameters * clears any selected_adapters before calling internal_model.save_pretrained (#609) * Update __init__.py (#602) Check for incompatible modules before importing unsloth * Fixed unsloth/tokenizer_utils.py for chat training (#604) * Add GGML saving option to Unsloth for easier Ollama model creation and testing. (#345) * Add save to llama.cpp GGML to save.py. * Fix conversion command and path of convert to GGML function. * Add autosaving lora to the GGML function * Create lora save function for conversion to GGML * Test fix #2 for saving lora * Test fix #3 to save the lora adapters to convert to GGML * Remove unwated tokenizer saving for conversion to ggml and added a few print statements. * Needed tokenizer for saving, added it back, also made it more unslothy style by having positional arguments, and added a few messages. * Positional arguments didn't work out, so reverted to older version of the code, and added a few comments. * Test fix 1 for arch * Test fix 2 new Mistral error. * Test fix 3 * Revert to old version for testing. * Upload issue test fix 1 * Fix 2 uploading ggml * Positional ags added. * Temporray remove positional args * Fix upload again!!! * Add print statements and fix link * Make the calling name better * Create local saving for GGML * Add choosing directory to save local GGML. * Fix lil variable error in the save_to_custom_dir func * docs: Add LoraConfig parameters documentation (#619) * llama.cpp failing (#371) llama.cpp is failing to generate quantize versions for the trained models. Error: ```bash You might have to compile llama.cpp yourself, then run this again. You do not need to close this Python program. Run the following commands in a new terminal: You must run this in the same folder as you're saving your model. git clone https://github.com/ggerganov/llama.cpp cd llama.cpp && make clean && LLAMA_CUDA=1 make all -j Once that's done, redo the quantization. ``` But when i do clone this with recursive it works. Co-authored-by: Daniel Han <[email protected]> * fix libcuda_dirs import for triton 3.0 (#227) * fix libcuda_dirs import for triton 3.0 * Update __init__.py * Update __init__.py --------- Co-authored-by: Daniel Han <[email protected]> * Update save.py * Update __init__.py * Update fast_lora.py * Update save.py * Update save.py * Update save.py * Update loader.py * Update save.py * Update save.py * quantize now llama-quantize * Update chat_templates.py * Update loader.py * Update mapper.py * Update __init__.py * embedding size * Update qwen2.py * docs * Update README.md * Update qwen2.py * README: Fix minor typo. (#559) * README: Fix minor typo. One-character typo fix while reading. * Update README.md --------- Co-authored-by: Daniel Han <[email protected]> * Update mistral.py * Update qwen2.py * Update qwen2.py * Update qwen2.py * Update llama.py * Update llama.py * Update llama.py * Update README.md * FastMistralModel * Update mistral.py * Update mistral.py * Update mistral.py * Update mistral.py * Update mistral.py * Auto check rope scaling * Update llama.py * Update llama.py * Update llama.py * GPU support * Typo * Update gemma.py * gpu * Multiple GGUF saving * Update save.py * Update save.py * check PEFT and base * Update llama.py * Update llama.py * Update llama.py * Update llama.py * Update llama.py * Update chat_templates.py --------- Co-authored-by: Michael Han <[email protected]> Co-authored-by: Eliot Hall <[email protected]> Co-authored-by: Rickard Edén <[email protected]> Co-authored-by: XiaoYang <[email protected]> Co-authored-by: Oseltamivir <[email protected]> Co-authored-by: mahiatlinux <[email protected]> Co-authored-by: Sébastien De Greef <[email protected]> Co-authored-by: Alberto Ferrer <[email protected]> Co-authored-by: Thomas Viehmann <[email protected]> Co-authored-by: Walter Korman <[email protected]> * Fix bug in save.py with interpreting quantization_method as a string that prevents GGUF from saving * Implemented better list management and then forgot to actually call the new list variable, fixed * Check type of given quantization method and return type error if not list or string * Update save.py --------- Co-authored-by: Daniel Han <[email protected]> Co-authored-by: Michael Han <[email protected]> Co-authored-by: Eliot Hall <[email protected]> Co-authored-by: Rickard Edén <[email protected]> Co-authored-by: XiaoYang <[email protected]> Co-authored-by: Oseltamivir <[email protected]> Co-authored-by: mahiatlinux <[email protected]> Co-authored-by: Sébastien De Greef <[email protected]> Co-authored-by: Alberto Ferrer <[email protected]> Co-authored-by: Thomas Viehmann <[email protected]> Co-authored-by: Walter Korman <[email protected]>

…thod as …" (#652) This reverts commit 30605de.

…ightly

…ation_me…" (#653) This reverts commit e2b2083.

danielhanchen added 30 commits May 19, 2024 16:22

Update llama.py

7df08c4

offload

ba5b6ce

Update llama.py

a07057e

Update llama.py

4be9063

Update llama.py

3dc3d3f

Update llama.py

f1cc1e8

Update llama.py

5cb531a

Update llama.py

6bd8e60

Update llama.py

d1d57ff

continued pretraining trainer

7470f67

Update trainer.py

da9c1a6

Update trainer.py

2c68f56

Update trainer.py

217bf9d

Update trainer.py

6e85384

is_bfloat16_supported

77f9c51

Update __init__.py

c0e1d27

Update README.md

2b23b93

Update llama.py

902e23a

Merge branch 'main' into nightly

98f41ce

is_bfloat16_supported

3193cac

Update __init__.py

dfeaf4b

Mistral v3

1e84090

Merge branch 'main' into nightly

f63f32b

Phi 3 medium

57ad8e7

Update chat_templates.py

2b994b2

Update chat_templates.py

ff8171f

Phi-3

5ca8b58

Merge branch 'main' into nightly

98c2e81

Merge branch 'main' into nightly

3817660

Merge branch 'main' into nightly

f858145

danielhanchen and others added 28 commits June 14, 2024 23:38

Update mistral.py

6633d4a

Auto check rope scaling

e5bf125

Merge branch 'main' into nightly

d4f4bce

Update llama.py

341565b

Update llama.py

dd3c6b1

Update llama.py

6d1ae23

GPU support

d855ef9

Merge branch 'main' into nightly

da1fe76

Typo

6656446

Update gemma.py

9bd5fad

gpu

a3061b6

Merge branch 'main' into nightly

7e5155d

Multiple GGUF saving

513bd4d

Update save.py

fb54fbb

Update save.py

4cba3e2

Merge branch 'main' into nightly

979bb22

check PEFT and base

31811cf

Update llama.py

a0232a7

Update llama.py

c4c4ff4

Update llama.py

80e82a2

Update llama.py

f62237d

Update llama.py

7f864dc

Update chat_templates.py

4dda039

Merge branch 'main' into nightly

1ba18ac

Revert "Fix breaking bug in save.py with interpreting quantization_me…

e2b2083

…thod as …" (#652) This reverts commit 30605de.

Merge branch 'nightly' of https://github.com/unslothai/unsloth into n…

aeda849

…ightly

Revert "Revert "Fix breaking bug in save.py with interpreting quantiz…

0938ab8

…ation_me…" (#653) This reverts commit e2b2083.

danielhanchen merged commit a2ee568 into main Jun 16, 2024
1 check passed

chrehall68 mentioned this pull request Jun 17, 2024

Error while saving to GGUF model #626

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix GGUF #654

Fix GGUF #654

danielhanchen commented Jun 16, 2024

Fix GGUF #654

Fix GGUF #654

Conversation

danielhanchen commented Jun 16, 2024