-
Notifications
You must be signed in to change notification settings - Fork 486
Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
* add_exllamav2 * style * fix doc * simplify script * style * update perplexity measure * Revert "Merge branch 'add_exllamav2' into update-benchmark-gptq" This reverts commit f2dbdc2, reversing changes made to 216213e. * Merge branch 'add_exllamav2' into update-benchmark-gptq * fix arg in llama attention * flash_attention arg * Revert "Merge branch 'add_exllamav2' into update-benchmark-gptq" This reverts commit 97a7c62. * update benchmark prefill and generate * replace by use_exllama_v2 * update benchmark arg * switch to a config_dict instead of disable_exllamav2 * Apply suggestions from code review Co-authored-by: fxmarty <[email protected]> * better tests * style * style --------- Co-authored-by: fxmarty <[email protected]>
- Loading branch information
Showing
2 changed files
with
211 additions
and
139 deletions.
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Oops, something went wrong.