[Bugfix] Update expected shape for per token strategy #210

kylesayrs · 2024-11-19T02:07:56Z

Background

While implementing Accelerate Utilities #193, a bug was discovered where the shape of per-token scales is being initialized with the incorrect shape

Changes

Change shape of initialized quantization parameters in per token case
Unrelated, compare tuples rather than lists in tests/test_quantization/test_configs/test_strategies.py

Testing

Added new tests in tests/test_quantization/lifecycle/test_initialize.py

dsikka

Where are you seeing the failures?

The only relevant case is dynamic per token, which shouldn't be initializing anything as the scales are determined on the fly - so I dont think this change is required.

kylesayrs · 2024-11-20T19:45:30Z

@dsikka There is a per-token non-dynamic test case in the tests.

I discovered this bug while implementing #193, which uses copy_ rather than out-of-place assignment when updating quantization parameters. Because copy_ requires the original shape (which used to be (1, )) to be the same as the newly updating shape (which is (1, 1)), an error was thrown.

In general, the initialized shape should be the same as the shape computed by q_params(). While I agree this bug only appears in special cases and that this change isn't necessary outside of the context of #193, I believe that it is correct.

dsikka · 2024-11-21T02:37:17Z

@dsikka There is a per-token non-dynamic test case in the tests.

I discovered this bug while implementing #193, which uses copy_ rather than out-of-place assignment when updating quantization parameters. Because copy_ requires the original shape (which used to be (1, )) to be the same as the newly updating shape (which is (1, 1)), an error was thrown.

In general, the initialized shape should be the same as the shape computed by q_params(). While I agree this bug only appears in special cases and that this change isn't necessary outside of the context of #193, I believe that it is correct.

Yeah I agree that the shape is correct.
What I'm thinking about now is when is the shape ever just 1. I guess in the static per tensor case but we actually update this in vLLM to be (1, 1) anyway (have to confirm if this is still the case) so we could make this the default potentially

tests/test_quantization/test_configs/test_strategies.py

kylesayrs · 2024-11-21T18:22:17Z

@dsikka From my reading, itlooks like it's just (1, ) https://github.com/vllm-project/vllm/blob/main/vllm/model_executor/layers/quantization/compressed_tensors/schemes/compressed_tensors_w8a8_int8.py#L121-L124

This is my understanding of the different expected shapes

Strategy	Scale Shape	ZP Shape
Tensor	(1, )	(1, )
Channel	(out_dims, 1)	(out_dims, 1)
Group	(in_dims, out_dims // group_size)	(in_dims, out_dims // group_size)
Block	(1, )	(1, )
Token	(1, 1)	(1, 1)

I haven't explored block and token quantization much, but the (1, ) shape for block quantization seems suspicious to me. Maybe @rahul-tuli you might be familiar?

horheynm · 2024-11-21T18:51:24Z

@dsikka From my reading, itlooks like it's just (1, ) https://github.com/vllm-project/vllm/blob/main/vllm/model_executor/layers/quantization/compressed_tensors/schemes/compressed_tensors_w8a8_int8.py#L121-L124

This is my understanding of the different expected shapes

Strategy Scale Shape ZP Shape
Tensor (1, ) (1, )
Channel (out_dims, 1) (out_dims, 1)
Group (in_dims, out_dims // group_size) (in_dims, out_dims // group_size)
Block (1, ) (1, )
Token (1, 1) (1, 1)
I haven't explored block and token quantization much, but the (1, ) shape for block quantization seems suspicious to me. Maybe @rahul-tuli you might be familiar?

This is helpful, maybe should include in docs somewhere. Hard to navigate to here

horheynm · 2024-11-21T18:52:03Z

tests/test_quantization/lifecycle/test_initialize.py

+        ),
+    ],
+)
+def test_initialize_quantization_parameters(weights, input_activations):


general question - for output activation we dont need?

We could test output activation, but I decided not to for test simplicity

dsikka

I would also make sure we're mocking correctly since the test case you pointed out applies the config to initialize scales/zp (which would be impacted by your change) but the mock doesn't seem to care about the shape
https://github.com/neuralmagic/compressed-tensors/blob/main/tests/conftest.py

Signed-off-by: Kyle Sayers <[email protected]>

kylesayrs added 2 commits November 19, 2024 00:54

update expected shape for per token strategy

cf99e09

add tests

14795ad

kylesayrs mentioned this pull request Nov 19, 2024

Accelerate Utilities #193

Merged

kylesayrs self-assigned this Nov 19, 2024

dsikka requested changes Nov 20, 2024

View reviewed changes

rahul-tuli previously approved these changes Nov 21, 2024

View reviewed changes

tests/test_quantization/test_configs/test_strategies.py Show resolved Hide resolved

horheynm reviewed Nov 21, 2024

View reviewed changes

horheynm previously approved these changes Nov 21, 2024

View reviewed changes

rahul-tuli mentioned this pull request Nov 27, 2024

Bump version to v0.8.1 #216

Closed

4 tasks

dsikka reviewed Nov 28, 2024

View reviewed changes

kylesayrs added 2 commits December 17, 2024 18:16

wip

bb83891

add helpers test

5b43fd4

Signed-off-by: Kyle Sayers <[email protected]>

kylesayrs dismissed stale reviews from horheynm and rahul-tuli via 5b43fd4 December 19, 2024 16:32

kylesayrs added 2 commits December 19, 2024 16:33

remove breakpoint

ae895ef

Signed-off-by: Kyle Sayers <[email protected]>

remove unnecessary arg

05b68aa

dsikka merged commit 975cb22 into main Dec 19, 2024
1 check passed

dsikka deleted the kylesayrs/fix-per-token-initialization branch December 19, 2024 16:49

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[Bugfix] Update expected shape for per token strategy #210

[Bugfix] Update expected shape for per token strategy #210

Uh oh!

kylesayrs commented Nov 19, 2024 •

edited

Loading

Uh oh!

dsikka left a comment •

edited

Loading

Uh oh!

kylesayrs commented Nov 20, 2024

Uh oh!

dsikka commented Nov 21, 2024

Uh oh!

Uh oh!

kylesayrs commented Nov 21, 2024

Uh oh!

horheynm commented Nov 21, 2024

Uh oh!

horheynm Nov 21, 2024

Uh oh!

kylesayrs Nov 22, 2024

Uh oh!

dsikka left a comment •

edited

Loading

Uh oh!

Uh oh!

Uh oh!

[Bugfix] Update expected shape for per token strategy #210

[Bugfix] Update expected shape for per token strategy #210

Uh oh!

Conversation

kylesayrs commented Nov 19, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Background

Changes

Testing

Uh oh!

dsikka left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

kylesayrs commented Nov 20, 2024

Uh oh!

dsikka commented Nov 21, 2024

Uh oh!

Uh oh!

kylesayrs commented Nov 21, 2024

Uh oh!

horheynm commented Nov 21, 2024

Uh oh!

horheynm Nov 21, 2024

Choose a reason for hiding this comment

Uh oh!

kylesayrs Nov 22, 2024

Choose a reason for hiding this comment

Uh oh!

dsikka left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

kylesayrs commented Nov 19, 2024 •

edited

Loading

dsikka left a comment •

edited

Loading

dsikka left a comment •

edited

Loading