Fix QKV dtype in the bwd of FP8+CP #1134

xrennvidia · 2024-08-26T08:03:23Z

Description

Fix the QKV dtype in the bwd of FP8+CP.

Type of change

Documentation change (change only to the documentation, either a fix or a new content)
Bug fix (non-breaking change which fixes an issue)
New feature (non-breaking change which adds functionality)
Breaking change (fix or feature that would cause existing functionality to not work as expected)
Infra/Build change
Code refractor

Changes

Please list the changes introduced in this PR:

Change A
Change B

Checklist:

I have read and followed the contributing guidelines
The functionality is complete
I have commented my code, particularly in hard-to-understand areas
I have made corresponding changes to the documentation
My changes generate no new warnings
I have added tests that prove my fix is effective or that my feature works
New and existing unit tests pass locally with my changes

Signed-off-by: Xiaowei Ren <[email protected]>

cyanguwa · 2024-08-26T18:01:47Z

Can we test both the E4M3 and HYBRID fp8 recipes please? Can we add that to the unit tests?

xrennvidia · 2024-08-26T18:33:49Z

Can we test both the E4M3 and HYBRID fp8 recipes please? Can we add that to the unit tests?

This is easy, we just need to change a flag of FP8 recipe, but I think this probably will not add anything additional to the test. I am comparing the results of CP>1 vs. CP=1. E4M2 and HYBRID share the same code, and I think HYBRID test can cover everything of E4M3. For example the bug in this PR is for HYBRID only, it should work for E4M3 because fwd and bwd dtype are same.

Maybe I am misunderstanding your point. In your mind, do you have anything special of E4M3 that cannot be covered by HYBRID test?

Signed-off-by: Xiaowei Ren <[email protected]>

for more information, see https://pre-commit.ci

Signed-off-by: Xiaowei Ren <[email protected]>

xrennvidia · 2024-08-28T21:17:47Z

/te-ci pytorch

Signed-off-by: Xiaowei Ren <[email protected]>

xrennvidia · 2024-08-29T22:06:35Z

/te-ci pytorch

xrennvidia · 2024-08-29T22:08:57Z

/te-ci pytorch

cyanguwa

LGTM

* fix qkv_dtype of FP8+CP Signed-off-by: Xiaowei Ren <[email protected]> * config cp correction dtype of FP8+CP Signed-off-by: Xiaowei Ren <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * code style change Signed-off-by: Xiaowei Ren <[email protected]> * always do FP8 CP correction in FP32 Signed-off-by: Xiaowei Ren <[email protected]> --------- Signed-off-by: Xiaowei Ren <[email protected]> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Co-authored-by: Charlene Yang <[email protected]>

xrennvidia added 2 commits August 23, 2024 13:31

fix qkv_dtype of FP8+CP

7c926f5

Signed-off-by: Xiaowei Ren <[email protected]>

Merge branch 'main' into xren/cp_fp8_fix

80c4af5

cyanguwa assigned cyanguwa and unassigned cyanguwa Aug 26, 2024

cyanguwa self-requested a review August 26, 2024 18:02

xrennvidia and others added 4 commits August 26, 2024 22:34

config cp correction dtype of FP8+CP

184ad60

Signed-off-by: Xiaowei Ren <[email protected]>

[pre-commit.ci] auto fixes from pre-commit.com hooks

e383bbe

for more information, see https://pre-commit.ci

code style change

62e620a

Signed-off-by: Xiaowei Ren <[email protected]>

Merge branch 'main' into xren/cp_fp8_fix

bda0506

ksivaman added the 1.10.0 label Aug 28, 2024

always do FP8 CP correction in FP32

24c626d

Signed-off-by: Xiaowei Ren <[email protected]>

Merge branch 'main' into xren/cp_fp8_fix

5b5d8d7

cyanguwa approved these changes Aug 29, 2024

View reviewed changes

Merge branch 'main' into xren/cp_fp8_fix

8e34de1

cyanguwa merged commit 9437ceb into NVIDIA:main Aug 30, 2024
15 checks passed

xrennvidia deleted the xren/cp_fp8_fix branch August 30, 2024 17:04

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix QKV dtype in the bwd of FP8+CP #1134

Fix QKV dtype in the bwd of FP8+CP #1134

xrennvidia commented Aug 26, 2024 •

edited

Loading

cyanguwa commented Aug 26, 2024

xrennvidia commented Aug 26, 2024

xrennvidia commented Aug 28, 2024

xrennvidia commented Aug 29, 2024

xrennvidia commented Aug 29, 2024

cyanguwa left a comment

Fix QKV dtype in the bwd of FP8+CP #1134

Fix QKV dtype in the bwd of FP8+CP #1134

Conversation

xrennvidia commented Aug 26, 2024 • edited Loading

Description

Type of change

Changes

Checklist:

cyanguwa commented Aug 26, 2024

xrennvidia commented Aug 26, 2024

xrennvidia commented Aug 28, 2024

xrennvidia commented Aug 29, 2024

xrennvidia commented Aug 29, 2024

cyanguwa left a comment

Choose a reason for hiding this comment

xrennvidia commented Aug 26, 2024 •

edited

Loading