Remove QuantizationScheme.default_scheme #202

kylesayrs · 2024-11-01T22:04:56Z

Purpose

Remove unused function. I believe that this function was initially intended to provide defaults to QuantizationModifier while not overwriting existing configs, but since all these values are all written to configs and the QuantizationModifier is now also the default for configs which do not specify values, this function is no longer necessary.

Prerequisites

Explicit defaults for QuantizationModifier targets vllm-project/llm-compressor#889

Changes

Remove QuantizationScheme.default_scheme

Testing

Grepped codebases for uses

Signed-off-by: Kyle Sayers <[email protected]>

dsikka

Can you verify behaviour for cases that where we're only running kv_cache quantization? That is the only case I can recall where this function was relevant.

kylesayrs · 2024-11-19T03:32:08Z

@dsikka Good idea. I tested a modified examples/quantization_kv_cache/llama3_fp8_kv_example.py with only kv cache quantization e2e with vllm
LC = kylesayrs/quantization-modifier-defaults
CT = this branch

dsikka · 2024-11-21T02:46:06Z

For completeness, could we validate that the example loads properly in vllm?
Alternatively, just make sure the keys/values are attached to the attention block in the state dict

kylesayrs · 2024-11-21T15:37:11Z

@dsikka Yep, I validated that both kvcache quantization with weight&input quantization and without weight&input quantization load and produce valid results in vllm.

horheynm · 2024-11-21T19:09:29Z

I remember hitting issues with this since it was setting default scheme which was not intended.
I do think we can get rid of this

remove QuantizationScheme.default_scheme

6cc7792

Signed-off-by: Kyle Sayers <[email protected]>

dsikka reviewed Nov 3, 2024

View reviewed changes

kylesayrs self-assigned this Nov 19, 2024

kylesayrs requested a review from dsikka November 19, 2024 22:41

horheynm approved these changes Nov 21, 2024

View reviewed changes

dsikka approved these changes Nov 22, 2024

View reviewed changes

dsikka merged commit 7103a27 into main Nov 22, 2024
1 check passed

dsikka deleted the default-QuantizationScheme branch November 22, 2024 14:42

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Remove QuantizationScheme.default_scheme #202

Remove QuantizationScheme.default_scheme #202

kylesayrs commented Nov 1, 2024 •

edited

Loading

dsikka left a comment

kylesayrs commented Nov 19, 2024 •

edited

Loading

dsikka commented Nov 21, 2024 •

edited

Loading

kylesayrs commented Nov 21, 2024

horheynm commented Nov 21, 2024

Remove QuantizationScheme.default_scheme #202

Remove QuantizationScheme.default_scheme #202

Conversation

kylesayrs commented Nov 1, 2024 • edited Loading

Purpose

Prerequisites

Changes

Testing

dsikka left a comment

Choose a reason for hiding this comment

kylesayrs commented Nov 19, 2024 • edited Loading

dsikka commented Nov 21, 2024 • edited Loading

kylesayrs commented Nov 21, 2024

horheynm commented Nov 21, 2024

kylesayrs commented Nov 1, 2024 •

edited

Loading

kylesayrs commented Nov 19, 2024 •

edited

Loading

dsikka commented Nov 21, 2024 •

edited

Loading