[PT2E] Fix per-tensor observer issue with varying shape & rank #2177

Xia-Weiwen · 2025-05-06T12:09:32Z

Fixes #2094 and #2112
We may find inputs with varying shapes and ranks, e.g. when running Resnet18. The current implementation is based on block_size, which is not enough for such cases. The fix is simple: use block_size = -1 for each dimension for per-tensor quantization and update block_size for each input when inserting q/dq in convert.

pytorch-bot · 2025-05-06T12:09:36Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/ao/2177

📄 Preview Python docs built from this PR

Note: Links to docs will display an error until the docs builds have been completed.

❌ 1 New Failure

As of commit 330e2c0 with merge base 07ca637 ():

NEW FAILURE - The following job has failed:

Run Regression Tests / test-nightly (CUDA Nightly, linux.g5.12xlarge.nvidia.gpu, --pre torch --index-url https://downloa... / linux-job (gh)
test/integration/test_integration.py::TestUtils::test_get_model_size_autoquant_5_cuda

This comment was automatically generated by Dr. CI and updates every 15 minutes.

jerryzh168 · 2025-05-06T17:37:03Z

torchao/quantization/pt2e/observer.py

@@ -1891,6 +1891,10 @@ def convert(self, model: torch.fx.GraphModule, observer_node: Node):
            assert self.original_dtype is not None, (
                "Expecting original_dtype to be populated"
            )
+            # Since input shape & rank may change (e.g. Resnet18), here we need to update block_size for each input
+            self.block_size = get_block_size(


when does this happen? can you give an example? I thought using -1 for dynamic dimensions will be enough?

To reproduce the issue, you may run the code here: #2094 (comment)
You will have to using -1 for block_size without updating of self.block_size here.

are you saying the rank / number of dimension changes for input as well? can we use a single -1 to represent this case?

are you saying the rank / number of dimension changes for input as well?

Yes

can we use a single -1 to represent this case?

I think it's doable. But there are checks to guard len(self.block_size) == len(input.shape). We need to handle the special case for per-tensor quant at these locations. Is it ok?

facebook-github-bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label May 6, 2025

Xia-Weiwen added the topic: not user facing Use this tag if you don't want this PR to show up in release notes label May 6, 2025

Xia-Weiwen changed the title ~~[PT2E] Fix per-tensor observer issue with varing shape & rank~~ [PT2E] Fix per-tensor observer issue with varying shape & rank May 6, 2025

[PT2E] Fix per-tensor observer issue with varying shape & rank

2ac41fb

Xia-Weiwen force-pushed the fix_per_tensor_quant branch from 87f1249 to 2ac41fb Compare May 6, 2025 12:12

jerryzh168 reviewed May 6, 2025

View reviewed changes

Xia-Weiwen mentioned this pull request May 8, 2025

[PT2E] observers do not handle inputs with different shapes correctly #2112

Open

block_size = [-1] for per-tensor quantization

330e2c0

Xia-Weiwen requested a review from jerryzh168 May 8, 2025 07:15

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[PT2E] Fix per-tensor observer issue with varying shape & rank #2177

[PT2E] Fix per-tensor observer issue with varying shape & rank #2177

Xia-Weiwen commented May 6, 2025

pytorch-bot bot commented May 6, 2025 •

edited

Loading

jerryzh168 May 6, 2025 •

edited

Loading

Xia-Weiwen May 7, 2025

jerryzh168 May 7, 2025

Xia-Weiwen May 7, 2025

[PT2E] Fix per-tensor observer issue with varying shape & rank #2177

Are you sure you want to change the base?

[PT2E] Fix per-tensor observer issue with varying shape & rank #2177

Conversation

Xia-Weiwen commented May 6, 2025

pytorch-bot bot commented May 6, 2025 • edited Loading

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/ao/2177

❌ 1 New Failure

jerryzh168 May 6, 2025 • edited Loading

Choose a reason for hiding this comment

Xia-Weiwen May 7, 2025

Choose a reason for hiding this comment

jerryzh168 May 7, 2025

Choose a reason for hiding this comment

Xia-Weiwen May 7, 2025

Choose a reason for hiding this comment

pytorch-bot bot commented May 6, 2025 •

edited

Loading

jerryzh168 May 6, 2025 •

edited

Loading