[WIP][Quantization] Update default observer to be `MSE` #300

shanjiaz · 2025-04-15T18:56:08Z

Description
This PR addresses the following issue. Update such that the MSE is used as the default observer as opposed to MinMax.

Testing
Ran examples/quantization_w4a16 and inspected the observer. See QuantizationArgs:

weights=QuantizationArgs(num_bits=4, type='int', symmetric=True, group_size=128, strategy='group', block_structure=None, dynamic=False, actorder=None, observer='mse', observer_kwargs={})

More details:

Concern
@anmarques @eldarkurtic Wanted to reach out to confirm if this is what we wanted : )

rahul-tuli

Let's go!

brian-dellabetta

This seems like a pretty significant one-line change, that would affect pretty much every quantization pathway. Do we always want MSE over MinMax? Should we start a CHANGELOG so users are aware of internal changes like this?

src/compressed_tensors/quantization/quant_args.py

Co-authored-by: Brian Dellabetta <[email protected]>

kylesayrs · 2025-04-15T20:35:39Z

@brian-dellabetta : @dalistarh has validated that the MSE observer is better across the board (within reason). Logging changes is another topic outside of the scope of this PR I believe

kylesayrs

Since observers are only used by LLM Compressor, this change is safe to make without affecting existing checkpoints.

Just FYI for the future, this is not the case for actorder, as the checkpoints do not write the actorder explicitly, so modifying the CT checkpoint default will modify existing checkpoints. If we want to modify the actorder default, it will likely require adding an actorder field to GPTQModifier and resolving that with existing QuantizationArgs.

But side note aside, looks great!

shanjiaz · 2025-04-15T20:43:26Z

Since observers are only used by LLM Compressor, this change is safe to make without affecting existing checkpoints.

Just FYI for the future, this is not the case for actorder, as the checkpoints do not write the actorder explicitly, so modifying the CT checkpoint default will modify existing checkpoints. If we want to modify the actorder default, it will likely require adding an actorder field to GPTQModifier and resolving that with existing QuantizationArgs.

But side note aside, looks great!

Thanks Kyle! Will keep that in mind.

dsikka · 2025-04-15T21:03:42Z

This seems like a pretty significant one-line change, that would affect pretty much every quantization pathway. Do we always want MSE over MinMax? Should we start a CHANGELOG so users are aware of internal changes like this?

We should run through our lm-eval testing after this change lands. Ive already spoken to Helen about running the test

dsikka

We likely need to do two sets of validations before landing

Run through our lm-eval tests to ensure no regression
Validate timings/potential slowdowns.

We can connect on how to set this up @shanjiaz

shanjiaz · 2025-04-24T13:30:15Z

Test results available here.

brian-dellabetta

First PR! 🥳 🥇

update default observer to be mse

4b74871

rahul-tuli previously approved these changes Apr 15, 2025

View reviewed changes

brian-dellabetta reviewed Apr 15, 2025

View reviewed changes

src/compressed_tensors/quantization/quant_args.py Outdated Show resolved Hide resolved

Update src/compressed_tensors/quantization/quant_args.py

9aed941

Co-authored-by: Brian Dellabetta <[email protected]>

shanjiaz dismissed rahul-tuli’s stale review via 9aed941 April 15, 2025 20:14

kylesayrs approved these changes Apr 15, 2025

View reviewed changes

shanjiaz requested review from rahul-tuli and brian-dellabetta April 15, 2025 20:43

dsikka requested changes Apr 15, 2025

View reviewed changes

shanjiaz changed the title ~~[Quantization] Update default observer to be MSE~~ [WIP][Quantization] Update default observer to be MSE Apr 15, 2025

shanjiaz marked this pull request as draft April 15, 2025 22:43

brian-dellabetta approved these changes Apr 24, 2025

View reviewed changes

rahul-tuli approved these changes Apr 24, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[WIP][Quantization] Update default observer to be `MSE` #300

[WIP][Quantization] Update default observer to be `MSE` #300

shanjiaz commented Apr 15, 2025 •

edited

Loading

rahul-tuli left a comment

brian-dellabetta left a comment

kylesayrs commented Apr 15, 2025

kylesayrs left a comment

shanjiaz commented Apr 15, 2025

dsikka commented Apr 15, 2025

dsikka left a comment

shanjiaz commented Apr 24, 2025

brian-dellabetta left a comment

[WIP][Quantization] Update default observer to be MSE #300

Are you sure you want to change the base?

[WIP][Quantization] Update default observer to be MSE #300

Conversation

shanjiaz commented Apr 15, 2025 • edited Loading

rahul-tuli left a comment

Choose a reason for hiding this comment

brian-dellabetta left a comment

Choose a reason for hiding this comment

kylesayrs commented Apr 15, 2025

kylesayrs left a comment

Choose a reason for hiding this comment

shanjiaz commented Apr 15, 2025

dsikka commented Apr 15, 2025

dsikka left a comment

Choose a reason for hiding this comment

shanjiaz commented Apr 24, 2025

brian-dellabetta left a comment

Choose a reason for hiding this comment

[WIP][Quantization] Update default observer to be `MSE` #300

[WIP][Quantization] Update default observer to be `MSE` #300

shanjiaz commented Apr 15, 2025 •

edited

Loading