[PyTorch] Proxy class for low-precision tensor #1127

timmoon10 · 2024-08-21T20:54:29Z

Description

The Float8Tensor class effectively implements a proxy design pattern: it internally encodes data in FP8 with FP32 scaling factors but externally presents the interface of a plain PyTorch tensor in FP32/FP16/BF16. This PR generalizes this logic by moving the proxy logic to an abstract ~~ProxyTensor~~ QuantizedTensor class. I envision implementing other quantization schemes (e.g. block scaling) by subclassing ~~ProxyTensor~~ QuantizedTensor

Type of change

Documentation change (change only to the documentation, either a fix or a new content)
Bug fix (non-breaking change which fixes an issue)
New feature (non-breaking change which adds functionality)
Breaking change (fix or feature that would cause existing functionality to not work as expected)
Infra/Build change
Code refractor

Changes

Add base class for tensor proxies

Checklist:

I have read and followed the contributing guidelines
The functionality is complete
I have commented my code, particularly in hard-to-understand areas
I have made corresponding changes to the documentation
My changes generate no new warnings
I have added tests that prove my fix is effective or that my feature works
New and existing unit tests pass locally with my changes

Signed-off-by: Tim Moon <[email protected]>

timmoon10 · 2024-08-28T05:00:57Z

/te-ci pytorch

Signed-off-by: Tim Moon <[email protected]>

for more information, see https://pre-commit.ci

Signed-off-by: Tim Moon <[email protected]>

tests/pytorch/test_float8tensor.py

Signed-off-by: Tim Moon <[email protected]>

timmoon10 · 2024-09-03T18:36:56Z

/te-ci pytorch

Signed-off-by: Tim Moon <[email protected]>

timmoon10 · 2024-09-04T22:50:18Z

/te-ci pytorch

Signed-off-by: Tim Moon <[email protected]>

timmoon10 · 2024-09-05T18:42:58Z

/te-ci pytorch

transformer_engine/pytorch/utils.py

ksivaman · 2024-09-10T20:53:41Z

/te-ci pytorch

ksivaman

LGTM, pipeline is clean on rerun: 18307147

* Add base class for tensor proxies Signed-off-by: Tim Moon <[email protected]> * Move tensor detaching logic to tensor proxy base class Signed-off-by: Tim Moon <[email protected]> * Use Python wrappers to PyTorch extensions Signed-off-by: Tim Moon <[email protected]> * Include transpose caching logic in proxy encode function Signed-off-by: Tim Moon <[email protected]> * Debug dimension mismatch with amax history Signed-off-by: Tim Moon <[email protected]> * Move dequantize logic to proxy_decode func Signed-off-by: Tim Moon <[email protected]> * Rename to "QuantizedTensor" Signed-off-by: Tim Moon <[email protected]> * Rename "proxy_detach" to "detach" Signed-off-by: Tim Moon <[email protected]> * Include transpose cache in detach and clone funcs Signed-off-by: Tim Moon <[email protected]> * Fix linter warnings Signed-off-by: Tim Moon <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Update FP8 workspaces with QuantizedTensor functions Signed-off-by: Tim Moon <[email protected]> * Move logic for FP8 transpose cache in FP8 workspaces to base class Signed-off-by: Tim Moon <[email protected]> * Remove cast-transpose logic from linear op Signed-off-by: Tim Moon <[email protected]> * Remove unnecessary args for Float8Tensor when using FP8 attr dict Signed-off-by: Tim Moon <[email protected]> * Remove __torch_function__ to QuantizedTensor Signed-off-by: Tim Moon <[email protected]> * Fix linter warnings Signed-off-by: Tim Moon <[email protected]> * Update tests/pytorch/test_float8tensor.py Signed-off-by: Tim Moon <[email protected]> * Debug FP8 transpose test Signed-off-by: Tim Moon <[email protected]> * Debug cast functions Signed-off-by: Tim Moon <[email protected]> --------- Signed-off-by: Tim Moon <[email protected]> Signed-off-by: Tim Moon <[email protected]> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Co-authored-by: Kirthi Shankar Sivamani <[email protected]>

timmoon10 added 5 commits August 9, 2024 18:06

Add base class for tensor proxies

02b1b55

Signed-off-by: Tim Moon <[email protected]>

Move tensor detaching logic to tensor proxy base class

7ba661e

Signed-off-by: Tim Moon <[email protected]>

Include scale-inv update with FP8 cast kernels

f67eca3

Signed-off-by: Tim Moon <[email protected]>

Use Python wrappers to PyTorch extensions

68cba5e

Signed-off-by: Tim Moon <[email protected]>

Include transpose caching logic in proxy encode function

b699841

Signed-off-by: Tim Moon <[email protected]>

timmoon10 added 6 commits August 28, 2024 12:27

Debug dimension mismatch with amax history

0f6e9ce

Signed-off-by: Tim Moon <[email protected]>

Move dequantize logic to proxy_decode func

ad319c5

Signed-off-by: Tim Moon <[email protected]>

Rename to "QuantizedTensor"

4f6ee11

Signed-off-by: Tim Moon <[email protected]>

Rename "proxy_detach" to "detach"

71995bb

Signed-off-by: Tim Moon <[email protected]>

Include transpose cache in detach and clone funcs

814aa6b

Signed-off-by: Tim Moon <[email protected]>

Fix linter warnings

77e71b8

Signed-off-by: Tim Moon <[email protected]>

timmoon10 force-pushed the proxy-tensor branch from 97a3c0b to 77e71b8 Compare August 28, 2024 22:56

pre-commit-ci bot and others added 7 commits August 28, 2024 22:56

[pre-commit.ci] auto fixes from pre-commit.com hooks

d4155c9

for more information, see https://pre-commit.ci

Update FP8 workspaces with QuantizedTensor functions

c68184b

Signed-off-by: Tim Moon <[email protected]>

Move logic for FP8 transpose cache in FP8 workspaces to base class

b1b851a

Signed-off-by: Tim Moon <[email protected]>

Remove cast-transpose logic from linear op

15b64f2

Signed-off-by: Tim Moon <[email protected]>

Remove unnecessary args for Float8Tensor when using FP8 attr dict

9caf075

Signed-off-by: Tim Moon <[email protected]>

Remove __torch_function__ to QuantizedTensor

0842d60

Signed-off-by: Tim Moon <[email protected]>

Fix linter warnings

3844009

Signed-off-by: Tim Moon <[email protected]>

timmoon10 marked this pull request as ready for review August 30, 2024 02:00

timmoon10 requested review from ksivaman and ptrendx August 30, 2024 02:00

timmoon10 changed the title ~~[WIP] [PyTorch] Proxy class for low-precision tensor~~ [PyTorch] Proxy class for low-precision tensor Aug 30, 2024

ptrendx reviewed Aug 30, 2024

View reviewed changes

tests/pytorch/test_float8tensor.py Outdated Show resolved Hide resolved

timmoon10 commented Aug 30, 2024

View reviewed changes

tests/pytorch/test_float8tensor.py Show resolved Hide resolved

timmoon10 added 2 commits August 30, 2024 10:57

Update tests/pytorch/test_float8tensor.py

1845402

Signed-off-by: Tim Moon <[email protected]>

Merge branch 'main' into proxy-tensor

0432421

timmoon10 mentioned this pull request Sep 3, 2024

[PyTorch] Remove some direct calls to PyTorch extensions in Float8Tensor #1137

Closed

13 tasks

timmoon10 added 2 commits September 4, 2024 14:35

Merge branch 'main' into proxy-tensor

b4a5555

Debug FP8 transpose test

6833712

Signed-off-by: Tim Moon <[email protected]>

timmoon10 added 2 commits September 5, 2024 11:38

Debug cast functions

47a4860

Signed-off-by: Tim Moon <[email protected]>

Merge branch 'main' into proxy-tensor

753b3f0

ksivaman reviewed Sep 9, 2024

View reviewed changes

transformer_engine/pytorch/utils.py Show resolved Hide resolved

timmoon10 mentioned this pull request Sep 10, 2024

[WIP] [PyTorch] Proof-of-concept for using operation-based API in modules #1173

Draft

13 tasks

Merge branch 'main' into proxy-tensor

0d11b5a

ksivaman approved these changes Sep 11, 2024

View reviewed changes

ksivaman merged commit 2d57db8 into NVIDIA:main Sep 11, 2024
25 of 26 checks passed

kshitij12345 mentioned this pull request Sep 17, 2024

TE - fix - removal of with_transpose argument Lightning-AI/lightning-thunder#1157

Merged

timmoon10 mentioned this pull request Oct 30, 2024

[PyTorch] Add heuristics for intializing FP8 params #1300

Open

13 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[PyTorch] Proxy class for low-precision tensor #1127

[PyTorch] Proxy class for low-precision tensor #1127

timmoon10 commented Aug 21, 2024 •

edited

Loading

timmoon10 commented Aug 28, 2024

timmoon10 commented Sep 3, 2024

timmoon10 commented Sep 4, 2024

timmoon10 commented Sep 5, 2024

ksivaman commented Sep 10, 2024

ksivaman left a comment

[PyTorch] Proxy class for low-precision tensor #1127

[PyTorch] Proxy class for low-precision tensor #1127

Conversation

timmoon10 commented Aug 21, 2024 • edited Loading

Description

Type of change

Changes

Checklist:

timmoon10 commented Aug 28, 2024

timmoon10 commented Sep 3, 2024

timmoon10 commented Sep 4, 2024

timmoon10 commented Sep 5, 2024

ksivaman commented Sep 10, 2024

ksivaman left a comment

Choose a reason for hiding this comment

timmoon10 commented Aug 21, 2024 •

edited

Loading