Skip to content

Pull requests: vllm-project/llm-compressor

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

[Multi-Quant] follow-up to README and example ready When a PR is ready for review
#1855 opened Sep 22, 2025 by brian-dellabetta Loading…
2 tasks done
Small param fixes
#1854 opened Sep 22, 2025 by fynnsu Loading…
Moe calibration refactor
#1851 opened Sep 22, 2025 by sairampillai Draft
Move OBCQ to SparseGPT ready When a PR is ready for review
#1842 opened Sep 18, 2025 by Roy214 Loading…
MSE observer for NVFP4
#1840 opened Sep 17, 2025 by shubhra Loading…
ready label check ready When a PR is ready for review
#1832 opened Sep 17, 2025 by brian-dellabetta Loading…
1 task done
Consolidate example script tests into single parametrized test ready When a PR is ready for review
#1801 opened Sep 5, 2025 by fynnsu Loading…
add support for per-head attention quantization
#1791 opened Sep 2, 2025 by eldarkurtic Loading…
[QuantizationFormat] Remove code inferring format ready When a PR is ready for review
#1786 opened Aug 29, 2025 by dsikka Loading…
[MXFP4] Add mxfp4 support
#1783 opened Aug 28, 2025 by dsikka Draft
[Transform] Spinquant R3 ready When a PR is ready for review
#1778 opened Aug 27, 2025 by kylesayrs Loading…
[Tracing] Support Cohere Vision, Decouple vision tower from first layer ready When a PR is ready for review
#1710 opened Aug 6, 2025 by kylesayrs Loading…
[Example] [VLM] Gemma3n
#1696 opened Jul 31, 2025 by kylesayrs Draft
[Autowrapper] Support Gemma3n, autowrapper improvements ready When a PR is ready for review
#1693 opened Jul 30, 2025 by kylesayrs Loading…
1686 Logic matching refactor
#1687 opened Jul 28, 2025 by ved1beta Loading…
add quantization_w4a4_fp4 qwen3 example
#1681 opened Jul 24, 2025 by wangwenmingaa Loading…
[KV Cache] support kv cache int8 per channel quantization ready When a PR is ready for review
#1663 opened Jul 19, 2025 by Eviannn Loading…
ProTip! Find all pull requests that aren't related to any open issues with -linked:issue.