Feat (proxy): flag to enable/disable QT return #1083

Giuseppe5 · 2024-10-31T17:25:18Z

Reason for this PR

Considering we cache quant metadata during export, we do not need to propagate QuantTensors during export.
This will come in handy for future features, including lazy dequantization which assumes a different paradigm in inference vs export.

Changes Made in this PR

Return a normal tensor from proxies during export.
Set all return_quant_tensor to False during tracing, and restore their states immediately after.

Testing Summary

NA

Risk Highlight

This PR includes code from another work (please detail).
This PR contains API-breaking changes.
This PR depends on work in another PR (please provide links/details).
This PR introduces new dependencies (please detail).
There are coverage gaps not covered by tests.
Documentation updates required in subsequent PR.

Checklist

Code comments added to any hard-to-understand areas, if applicable.
Changes generate no new warnings.
Updated any relevant tests, if applicable.
No conflicts with destination dev branch.
I reviewed my own code changes.
Initial CI/CD passing.
1+ reviews given, and any review issues addressed and approved.
Post-review full CI/CD passing.

nickfraser

See the comment. If you think this is the best approach then approved - please merge (after FINN integration is fixed!)

src/brevitas/nn/quant_avg_pool.py

Giuseppe5 force-pushed the export_change branch from 21f055d to 84c4309 Compare November 28, 2024 15:11

Giuseppe5 force-pushed the export_change branch from 2e273bf to 3ccf130 Compare December 17, 2024 10:13

Giuseppe5 changed the title ~~Feat (export): dequantize during export~~ Feat (proxy): flag to enable/disable QT return Dec 17, 2024

Giuseppe5 added 7 commits December 17, 2024 14:11

Feat (proxy): flag to enable/disable QT return

3b27bee

Update runtime_quant.py

026bb3f

fix parameter proxy

b8094b9

conflicting flags

a058333

fix

da58e2f

bugfixes

895ce80

Update

b8c4877

Giuseppe5 force-pushed the export_change branch from 839feb0 to b8c4877 Compare December 17, 2024 14:24

fix

a5e06a4

nickfraser approved these changes Dec 17, 2024

View reviewed changes

src/brevitas/nn/quant_avg_pool.py Show resolved Hide resolved

Giuseppe5 requested a review from nickfraser December 17, 2024 17:19

Giuseppe5 merged commit 3612e90 into Xilinx:dev Dec 18, 2024
390 of 396 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Feat (proxy): flag to enable/disable QT return #1083

Feat (proxy): flag to enable/disable QT return #1083

Giuseppe5 commented Oct 31, 2024 •

edited

Loading

nickfraser left a comment •

edited

Loading

Feat (proxy): flag to enable/disable QT return #1083

Feat (proxy): flag to enable/disable QT return #1083

Conversation

Giuseppe5 commented Oct 31, 2024 • edited Loading

Reason for this PR

Changes Made in this PR

Testing Summary

Risk Highlight

Checklist

nickfraser left a comment • edited Loading

Choose a reason for hiding this comment

Giuseppe5 commented Oct 31, 2024 •

edited

Loading

nickfraser left a comment •

edited

Loading