[AMD] Fix and enable upcasting fp8e4m3nv to fp16 #5604

knwng · 2025-01-14T05:12:20Z

This commit fixed and enabled fp8e4m3nv to fp16 upcasting.
In order to properly handle denorms with LUT, this operation is no longer vectorized.

New contributor declaration

I am not making a trivial change, such as fixing a typo in a comment.
I have written a PR description following these
rules.
I have run pre-commit run --from-ref origin/main --to-ref HEAD.
Select one of the following.
- I have added tests.
  - /test for lit tests
  - /unittest for C++ tests
  - /python/test for end-to-end tests
- This PR does not need a test because FILL THIS IN.
Select one of the following.
- I have not added any lit tests.
- The lit tests I have added follow these best practices,
  including the "tests should be minimal" section. (Usually running Python code
  and using the instructions it generates is not minimal.)

This commit fixed and enabled fp8e4m3nv to fp16 upcasting. In order to properly handle denorms with LUT, this operation is no longer vectorized.

antiagainst

Can you also turn on the test by dropping these lines

https://github.com/triton-lang/triton/blob/3bac3be/python/test/unit/language/test_core.py#L3541-L3542

[AMD] Fix and enable upcasting fp8e4m3nv to fp16

6c6afa0

This commit fixed and enabled fp8e4m3nv to fp16 upcasting. In order to properly handle denorms with LUT, this operation is no longer vectorized.

antiagainst marked this pull request as ready for review January 14, 2025 19:35

antiagainst requested review from antiagainst, zhanglx13 and ptillet as code owners January 14, 2025 19:35

antiagainst requested changes Jan 14, 2025

View reviewed changes

antiagainst added 2 commits January 15, 2025 00:34

Enable scaled dot tests

c1384c6

Merge remote-tracking branch 'origin/main' into fp8e4m3_to_fp16_new

33d079b

antiagainst approved these changes Jan 15, 2025

View reviewed changes

antiagainst merged commit aa833c9 into triton-lang:main Jan 15, 2025
7 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[AMD] Fix and enable upcasting fp8e4m3nv to fp16 #5604

[AMD] Fix and enable upcasting fp8e4m3nv to fp16 #5604

knwng commented Jan 14, 2025

antiagainst left a comment

[AMD] Fix and enable upcasting fp8e4m3nv to fp16 #5604

[AMD] Fix and enable upcasting fp8e4m3nv to fp16 #5604

Conversation

knwng commented Jan 14, 2025

New contributor declaration

antiagainst left a comment

Choose a reason for hiding this comment