Skip to content

Actions: vllm-project/llm-compressor

PR Reminder Comment Bot

Actions

Loading...
Loading

Show workflow options

Create status badge

Loading
146 workflow runs
146 workflow runs

Filter by Event

Filter by Status

Filter by Branch

Filter by Actor

typo
PR Reminder Comment Bot #74: Pull request #833 opened by horheynm
October 9, 2024 18:00 12s fix-abs-path
October 9, 2024 18:00 12s
Remove SparseAutoModelForCausalLM
PR Reminder Comment Bot #73: Pull request #832 opened by horheynm
October 9, 2024 16:46 12s remove-sparseAutoModelForCausalLM
October 9, 2024 16:46 12s
Typehint nits
PR Reminder Comment Bot #70: Pull request #826 opened by kylesayrs
October 7, 2024 17:58 12s kylesayrs/fix-typehint
October 7, 2024 17:58 12s
Install compressed-tensors after llm-compressor
PR Reminder Comment Bot #69: Pull request #825 opened by dbarbuzzi
October 7, 2024 14:14 12s dbarbuzzi:reorder-ct-install
October 7, 2024 14:14 12s
Awq re implementation
PR Reminder Comment Bot #68: Pull request #824 opened by rahul-tuli
October 7, 2024 13:54 12s awq-re-implementation
October 7, 2024 13:54 12s
Set Sparse compression to save_compressed
PR Reminder Comment Bot #67: Pull request #821 opened by rahul-tuli
October 6, 2024 21:50 11s set-sparse-compression-true
October 6, 2024 21:50 11s
Fix import of ModelCompressor
PR Reminder Comment Bot #66: Pull request #776 opened by rahul-tuli
October 4, 2024 13:53 10s fix-import
October 4, 2024 13:53 10s
[WIP] Example for 2:4 sparsity with w8a8
PR Reminder Comment Bot #65: Pull request #775 opened by mgoin
October 4, 2024 00:55 12s sparse-24-w8a8-example
October 4, 2024 00:55 12s
Update workflows/actions
PR Reminder Comment Bot #64: Pull request #774 opened by dbarbuzzi
October 3, 2024 14:46 10s dbarbuzzi:update-workflow-actions
October 3, 2024 14:46 10s
update test
PR Reminder Comment Bot #63: Pull request #773 opened by dsikka
October 3, 2024 01:47 17s update_test
October 3, 2024 01:47 17s
Fix 2/4 GPTQ Model Tests
PR Reminder Comment Bot #62: Pull request #769 opened by dsikka
October 2, 2024 20:49 10s fix_gptq_oneshot
October 2, 2024 20:49 10s
KV Cache, E2E Tests
PR Reminder Comment Bot #59: Pull request #742 opened by horheynm
October 1, 2024 16:16 13s kv-cache-e2e
October 1, 2024 16:16 13s
Rename to quantization config
PR Reminder Comment Bot #58: Pull request #730 opened by kylesayrs
September 29, 2024 20:44 11s kylesayrs/rename_to_quantization_config
September 29, 2024 20:44 11s
Add AutoModelForCausalLM example
PR Reminder Comment Bot #56: Pull request #698 opened by dsikka
September 27, 2024 20:09 11s add_automodel_example
September 27, 2024 20:09 11s
Model Initialization Context
PR Reminder Comment Bot #55: Pull request #695 opened by kylesayrs
September 27, 2024 15:14 11s kylesayrs/fast-load-context
September 27, 2024 15:14 11s
Move wrapper definition
PR Reminder Comment Bot #54: Pull request #694 opened by kylesayrs
September 27, 2024 14:48 16s kylesayrs/move-hf_wrap
September 27, 2024 14:48 16s
Increase Sparsity Threshold for compressors
PR Reminder Comment Bot #52: Pull request #679 opened by rahul-tuli
September 26, 2024 12:50 14s update-sparsity-threshold
September 26, 2024 12:50 14s
Remove FP8 hack, bump transformers version
PR Reminder Comment Bot #51: Pull request #676 opened by kylesayrs
September 25, 2024 20:23 11s kylesayrs/remove-fp8-hack
September 25, 2024 20:23 11s
[Bugfix] Workaround tied tensors bug
PR Reminder Comment Bot #50: Pull request #659 opened by kylesayrs
September 25, 2024 01:12 9s kylesayrs/shared_tensors_bug_workaround
September 25, 2024 01:12 9s
switch tests from weekly to nightly
PR Reminder Comment Bot #49: Pull request #658 opened by dhuangnm
September 24, 2024 17:51 14s weekly
September 24, 2024 17:51 14s
fix default test case
PR Reminder Comment Bot #48: Pull request #193 opened by dsikka
September 20, 2024 20:13 14s fix_tests
September 20, 2024 20:13 14s
Update MoE examples
PR Reminder Comment Bot #47: Pull request #192 opened by mgoin
September 20, 2024 18:05 15s moe-fp8-update
September 20, 2024 18:05 15s