Enable fp16/bf16 absmax #1672

jiqing-feng · 2025-06-06T06:14:10Z

Hi @matthewdouglas , enable fp16/bf16 absmax on XPU could get 20% speed-up on our qlora case. Please review it. I am checking if there are any failed tests on CUDA, will let you know once it's completed. BTW, the tests are too much...

Signed-off-by: jiqing-feng <[email protected]>

jiqing-feng · 2025-06-09T03:58:32Z

Hi @matthewdouglas . I kept cuda op the same as before, only enabled cpu/xpu absmax on half-precision. This PR could pass all cuda tests on A100 and all cpu tests on Intel Xeon node. For XPU, we have around 20 tests failed because of compile error but not introduced by this PR. So please review this PR. Thanks!

Signed-off-by: jiqing-feng <[email protected]>

github-actions · 2025-06-09T16:43:48Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

jiqing-feng · 2025-06-13T02:07:37Z

tests/test_functional.py

@@ -137,11 +135,10 @@ def test_dynamic_blockwise_quantization(self, device, dtype, nested, blocksize,
        abserr = sum(diffs) / len(diffs)
        relerr = sum(reldiffs) / len(reldiffs)
        if signed:
-            threshold_abserr = 0.0036 if device in ("cpu", "xpu") and (F.ipex_cpu or F.ipex_xpu) else 0.0035
            assert abserr < 0.0036


This is because threshold_abserr is not used.

jiqing-feng · 2025-06-13T02:08:33Z

tests/test_functional.py

            assert abserr < 0.0036
            assert relerr < 0.015
        else:
-            assert abserr < 0.00175 if device in ("cpu", "xpu") and (F.ipex_cpu or F.ipex_xpu) else 0.0023
+            assert abserr < 0.0023


We have no reason to have a tighter threshold for ipex, otherwise the half-precision check cannot pass.

jiqing-feng · 2025-06-17T05:58:29Z

Detect conflict with xpu sycl path, hold on this PR until xpu sycl path is merged.

jiqing-feng marked this pull request as ready for review June 6, 2025 08:50

jiqing-feng added 9 commits June 6, 2025 12:40

enable fp16/bf16 absmax

6e55c34

Signed-off-by: jiqing-feng <[email protected]>

fix absmax dtype

73543ef

Signed-off-by: jiqing-feng <[email protected]>

fix ipex op

18f9715

Signed-off-by: jiqing-feng <[email protected]>

fx tests

6f48548

Signed-off-by: jiqing-feng <[email protected]>

fix ipex input dtype

c4b3cca

Signed-off-by: jiqing-feng <[email protected]>

fix meta register dtype

d24f47d

Signed-off-by: jiqing-feng <[email protected]>

fix test threshold

3ce11b3

Signed-off-by: jiqing-feng <[email protected]>

revert mistake change

8799041

Signed-off-by: jiqing-feng <[email protected]>

Merge branch 'main' into absmax

50ee994

jiqing-feng force-pushed the absmax branch 2 times, most recently from 69b2146 to 50ee994 Compare June 9, 2025 02:45

keep cuda op

daad33d

Signed-off-by: jiqing-feng <[email protected]>

matthewdouglas added the Intel label Jun 9, 2025

matthewdouglas added this to the v0.47.0 milestone Jun 9, 2025

Merge branch 'main' into absmax

cb0756e

jiqing-feng commented Jun 13, 2025

View reviewed changes

jiqing-feng marked this pull request as draft June 17, 2025 05:58

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Enable fp16/bf16 absmax #1672

Enable fp16/bf16 absmax #1672

jiqing-feng commented Jun 6, 2025 •

edited

Loading

Uh oh!

jiqing-feng commented Jun 9, 2025

Uh oh!

github-actions bot commented Jun 9, 2025

Uh oh!

jiqing-feng Jun 13, 2025

Uh oh!

jiqing-feng Jun 13, 2025

Uh oh!

jiqing-feng commented Jun 17, 2025

Uh oh!

Uh oh!

Enable fp16/bf16 absmax #1672

Are you sure you want to change the base?

Enable fp16/bf16 absmax #1672

Conversation

jiqing-feng commented Jun 6, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

jiqing-feng commented Jun 9, 2025

Uh oh!

github-actions bot commented Jun 9, 2025

Uh oh!

jiqing-feng Jun 13, 2025

Choose a reason for hiding this comment

Uh oh!

jiqing-feng Jun 13, 2025

Choose a reason for hiding this comment

Uh oh!

jiqing-feng commented Jun 17, 2025

Uh oh!

Uh oh!

jiqing-feng commented Jun 6, 2025 •

edited

Loading