[BugFix] Fix Marlin24 Bug #203

dsikka · 2024-11-08T02:38:05Z

Summary

Fix the check for permutations which have been incorrectly been running for the channel-wise case
Idk how this ever worked for group quant

Generations Using Group Quantization in vLLM:

[' Paris and it is located in the north-west of the country.\nThe population of', ' the leader of the executive branch of the federal government, and thus the head of', ' Liz. I live in a small town in the middle of a prairie.']

rahul-tuli

Whaaaaaaaaaaaaaaaaaaat 🥲

kylesayrs

This should just be a syntax warning? Not sure if I see why this would raise an error

rahul-tuli · 2024-11-08T03:28:17Z

This should just be a syntax warning? Not sure if I see why this would raise an error

This doesn't raise an error, but before this diff the group quantized models with marlin_24 compressor were giving 0 accuracy

fix group quant bug

66511d0

rahul-tuli approved these changes Nov 8, 2024

View reviewed changes

kylesayrs approved these changes Nov 8, 2024

View reviewed changes

rahul-tuli merged commit db6ccb2 into main Nov 8, 2024
1 check passed

rahul-tuli deleted the fix_marlin24 branch November 8, 2024 03:29

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[BugFix] Fix Marlin24 Bug #203

[BugFix] Fix Marlin24 Bug #203

dsikka commented Nov 8, 2024 •

edited

Loading

rahul-tuli left a comment

kylesayrs left a comment

rahul-tuli commented Nov 8, 2024

[BugFix] Fix Marlin24 Bug #203

[BugFix] Fix Marlin24 Bug #203

Conversation

dsikka commented Nov 8, 2024 • edited Loading

rahul-tuli left a comment

Choose a reason for hiding this comment

kylesayrs left a comment

Choose a reason for hiding this comment

rahul-tuli commented Nov 8, 2024

dsikka commented Nov 8, 2024 •

edited

Loading