Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[Model] 1.58bits BitNet Model Support #7725

Open
wants to merge 27 commits into
base: main
Choose a base branch
from
Open
Show file tree
Hide file tree
Changes from all commits
Commits
Show all changes
27 commits
Select commit Hold shift + click to select a range
2be6218
Support Repack from GPTQ.
LeiWang1999 Jul 1, 2024
b92de92
chore: Remove unused input_size and output_size variables in MarlinLi…
LeiWang1999 Jul 1, 2024
71ea469
Support BitNet Model for 1.58bits.
LeiWang1999 Jul 16, 2024
dfa6b2f
Lint Fix
LeiWang1999 Jul 16, 2024
8d2c635
lint fix
LeiWang1999 Jul 16, 2024
41bb18e
Lint Fix for line length
LeiWang1999 Jul 16, 2024
29ac34d
Support Loading 1.58B Model with BitBLAS Format
LeiWang1999 Jul 17, 2024
7f69aef
Improve performance for bitnet
LeiWang1999 Jul 19, 2024
01a789a
Merge branch 'main' of https://github.com/vllm-project/vllm into bitb…
LeiWang1999 Jul 19, 2024
a973123
fix lm_head for gptq model refactor
LeiWang1999 Jul 19, 2024
aea1f4c
linx fix
LeiWang1999 Jul 19, 2024
17128d5
handle compressed scale weight.
LeiWang1999 Aug 13, 2024
1741ed4
lint fix
LeiWang1999 Aug 13, 2024
726a1f7
remove partial weight load for sw
LeiWang1999 Aug 15, 2024
68c8052
apply torch compile for uncompressed weight.
LeiWang1999 Aug 15, 2024
6eb2870
Merge branch 'main' of https://github.com/vllm-project/vllm into bitb…
LeiWang1999 Aug 15, 2024
52418ef
merge bug fix
LeiWang1999 Aug 15, 2024
a15ba12
lint fix
LeiWang1999 Aug 15, 2024
53babae
fix torch compile issue
LeiWang1999 Aug 18, 2024
40a4e53
bug fix.
LeiWang1999 Aug 20, 2024
d316a87
BENCHMARK SCRIPTS
LeiWang1999 Aug 20, 2024
4d40275
Merge branch 'main' of https://github.com/vllm-project/vllm into bitb…
LeiWang1999 Aug 20, 2024
bffc05b
Implement Test
LeiWang1999 Aug 20, 2024
8b0972b
lint fix
LeiWang1999 Aug 20, 2024
8e1a7e8
install bitblas by default to pass the doc gen.
LeiWang1999 Aug 20, 2024
7fbbccf
hide the bitblas import
LeiWang1999 Aug 20, 2024
c487e69
import fix
LeiWang1999 Aug 20, 2024
File filter

Filter by extension

Filter by extension


Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
5 changes: 4 additions & 1 deletion .gitignore
Original file line number Diff line number Diff line change
Expand Up @@ -192,4 +192,7 @@ _build/
hip_compat.h

# Benchmark dataset
benchmarks/*.json
*.json

# Debug files
debug/
Loading
Loading