Update: SparseGPT recipes #1142

rahul-tuli · 2025-02-12T18:33:57Z

Issue

The following test was failing:

FAILED tests/e2e/vLLM/test_vllm.py::TestvLLM_0_tests_e2e_vLLM_configs_sparse2of4_fp8_dynamic_yaml::test_vllm  
ValueError: There is no module or parameter named 'lm_head.bitmask' in LlamaForCausalLM

This issue arose due to recent improvements in SparseGPTModifier, which changed its default behavior. Previously, lm_head was silently ignored, but the new updates no longer do so automatically.

Fix

The fix involves explicitly updating the affected recipes to include the parameter:

ignore: ["re:.*lm_head"]

when all layers are targeted. This ensures that lm_head is properly excluded and prevents the failure.

Example Change

Previously, we relied on regex patterns to target linear layers while ignoring lm_head. The updated configuration now explicitly targets linear layers and ignores lm_head:

-    "targets": ["re:model.layers.\\d+$"],
+    "targets": ["Linear"],
+    "ignore": ["re:.*lm_head"]

This provides a more structured approach and avoids unnecessary regex-based filtering.

Additional Fixes & Improvements

Removed the deprecated argument sequential_update.
Updated recipes to use "targets": ["Linear"] instead of regex matching for better clarity and maintainability.
Raise Warning when lm_head is targetted. Contributed by @kylesayrs

To see the specific tasks where the Asana app for GitHub is being used, see below:
- https://app.asana.com/0/0/1209368043192615

github-actions · 2025-02-12T18:34:08Z

👋 Hi! Thank you for contributing to llm-compressor. Please add the ready label when the PR is ready for review.

Note: This is required to complete the testing suite, please only add the label once the PR is code complete and local testing has been performed.

Signed-off-by: Kyle Sayers <[email protected]> Signed-off-by: Rahul Tuli <[email protected]>

Signed-off-by: Rahul Tuli <[email protected]>

dsikka · 2025-02-12T18:55:28Z

Do we know why the lm_head in the sparse_2of4_only e2e test was being skipped by the sparse24 bitmask compressor?

dsikka · 2025-02-12T19:08:24Z

Is this ready for review? still draft

Signed-off-by: Rahul Tuli <[email protected]>

rahul-tuli · 2025-02-12T19:17:09Z

Is this ready for review? still draft

No it wasn't, now is!

tests/e2e/vLLM/recipes/Sparse_2of4/recipe_sparse_2of4.yaml

tests/llmcompressor/transformers/finetune/test_alternate_recipe.yaml

tests/e2e/vLLM/recipes/Sparse_2of4/recipe_sparse_2of4.yaml

tests/llmcompressor/transformers/finetune/test_alternate_recipe.yaml

Signed-off-by: Rahul Tuli <[email protected]>

kylesayrs and others added 2 commits February 12, 2025 18:34

ignore embedding, add warning for lm_head

29e471f

Signed-off-by: Kyle Sayers <[email protected]> Signed-off-by: Rahul Tuli <[email protected]>

Update: SparseGPT recipes to latest state

e8d94b6

Signed-off-by: Rahul Tuli <[email protected]>

rahul-tuli force-pushed the update-ignores-in-sparsegpt-recipes branch from 2f08151 to e8d94b6 Compare February 12, 2025 18:34

rahul-tuli requested review from dsikka, brian-dellabetta and horheynm February 12, 2025 18:49

rahul-tuli self-assigned this Feb 12, 2025

Fix: regex

052e6d9

Signed-off-by: Rahul Tuli <[email protected]>

rahul-tuli marked this pull request as ready for review February 12, 2025 19:17

kylesayrs previously approved these changes Feb 12, 2025

View reviewed changes

tests/e2e/vLLM/recipes/Sparse_2of4/recipe_sparse_2of4.yaml Outdated Show resolved Hide resolved

tests/llmcompressor/transformers/finetune/test_alternate_recipe.yaml Outdated Show resolved Hide resolved

rahul-tuli commented Feb 12, 2025

View reviewed changes

tests/e2e/vLLM/recipes/Sparse_2of4/recipe_sparse_2of4.yaml Outdated Show resolved Hide resolved

tests/llmcompressor/transformers/finetune/test_alternate_recipe.yaml Outdated Show resolved Hide resolved

Missed: sequential_updates

fc03739

Signed-off-by: Rahul Tuli <[email protected]>

rahul-tuli dismissed kylesayrs’s stale review via fc03739 February 12, 2025 19:33

kylesayrs approved these changes Feb 12, 2025

View reviewed changes

dsikka approved these changes Feb 12, 2025

View reviewed changes

dsikka merged commit 98a7ae6 into main Feb 12, 2025
7 checks passed

dsikka deleted the update-ignores-in-sparsegpt-recipes branch February 12, 2025 23:43

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Update: SparseGPT recipes #1142

Update: SparseGPT recipes #1142

rahul-tuli commented Feb 12, 2025 •

edited

Loading

github-actions bot commented Feb 12, 2025

dsikka commented Feb 12, 2025

dsikka commented Feb 12, 2025

rahul-tuli commented Feb 12, 2025

Update: SparseGPT recipes #1142

Update: SparseGPT recipes #1142

Conversation

rahul-tuli commented Feb 12, 2025 • edited Loading

Issue

Fix

Example Change

Additional Fixes & Improvements

github-actions bot commented Feb 12, 2025

dsikka commented Feb 12, 2025

dsikka commented Feb 12, 2025

rahul-tuli commented Feb 12, 2025

rahul-tuli commented Feb 12, 2025 •

edited

Loading