forked from LostRuins/koboldcpp
-
Notifications
You must be signed in to change notification settings - Fork 3
Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
Loading status checks…
llama : support for
falcon-mamba
architecture (ggml-org#9074)
* feat: initial support for llama.cpp * fix: lint * refactor: better refactor * Update src/llama.cpp Co-authored-by: compilade <[email protected]> * Update src/llama.cpp Co-authored-by: compilade <[email protected]> * fix: address comments * Update convert_hf_to_gguf.py Co-authored-by: compilade <[email protected]> * fix: add more cleanup and harmonization * fix: lint * Update gguf-py/gguf/gguf_writer.py Co-authored-by: compilade <[email protected]> * fix: change name * Apply suggestions from code review Co-authored-by: compilade <[email protected]> * add in operator * fix: add `dt_b_c_rms` in `llm_load_print_meta` * fix: correct printf format for bool * fix: correct print format * Update src/llama.cpp Co-authored-by: compilade <[email protected]> * llama : quantize more Mamba tensors * llama : use f16 as the fallback of fallback quant types --------- Co-authored-by: compilade <[email protected]>
- v1.84202_b4746
- v1.84200_b4726
- v1.84000_b4722
- v1.83110_b4717
- v1.83100_b4675
- v1.83020_b4675
- v1.83020_b4667
- v1.83020_b4467
- v1.83007_b4608
- v1.83005_b4569
- v1.83004_b4569
- v1.83002_b4517
- v1.83001_b4517
- v1.83000_b4517
- v1.82132_b4502
- v1.82131_b4502
- v1.82021_b4502
- v1.82020_b4491
- v1.82009_b4458
- v1.82008_b4458
- v1.82007_b4458
- v1.82006_b4458
- v1.82005_b4458
- v1.82004_b4458
- v1.82003_b4455
- v1.82002_b4450
- v1.82001_b4435
- v1.81102_b4407
- v1.81100_b4407
- v1.81001_b4407
- v1.80301_b3485
- v1.80300_b3485
- v1.80002_b4229
- v1.80001_b4229
- v1.78003_b4067
- v1.77060_b3962_rebased
- v1.77009_b3962
- v1.77008_b3962
- v1.77006_b3972
- v1.77005_b3962
- v1.77004_b3962
- v1.77002_b3934
- v1.76007_b3917
- v1.76005_b3906
- v1.76004_b3896
- v1.75201_b3826
- v1.73007_b3599-2+8+4
1 parent
f63f603
commit b40eb84
Showing
5 changed files
with
36 additions
and
24 deletions.
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters