ℹ️ feat: Add NPU and XPU support for activation offloading #4056

zilongzheng · 2025-09-10T09:12:33Z

What does this PR do?

This PR extends the OffloadActivations context manager to enable activation offloading on Intel XPU and Huawei NPU devices, in addition to the existing CUDA support.

The key changes include:

Detecting the current accelerator type (xpu, npu, or cuda).
Using the correct device-specific PyTorch APIs for creating and managing streams (e.g., torch.npu.Stream, torch.xpu.current_stream).
Resolving a TypeError on non-CUDA devices by using the proper stream context managers (torch.npu.stream and torch.xpu.stream) instead of attempting to use the raw stream object.

This allows users on NPU and XPU hardware to leverage the memory savings provided by activation offloading, improving their ability to train larger models.

Before submitting

This PR fixes a typo or improves the docs (you can dismiss the other checks if that's the case).
Did you read the contributor guideline,
Pull Request section?
Was this discussed/approved via a GitHub issue? Please add a link
to it if that's the case.
Did you make sure to update the documentation with your changes?
Did you write any new necessary tests?
- Reused test_activation_offloading.py on ascend npu devices.

Who can review?

Anyone in the community is free to review the PR once the tests have passed. Feel free to tag
members/contributors who may be interested in your PR.

qgallouedec · 2025-09-11T23:39:27Z

Thanks! I cannot test myself but LGTM!

qgallouedec · 2025-09-11T23:39:46Z

@kashif I let you merge if it looks good to you

HuggingFaceDocBuilderDev · 2025-09-18T03:38:50Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

Zilong Zheng and others added 2 commits September 10, 2025 12:28

fix activation_offloading issue for xpu and npu

17230e2

fix torch._C api error on torch_npu

1ac9f07

qgallouedec requested a review from kashif September 11, 2025 23:39

style

21dd18e

qgallouedec changed the title ~~feat: Add NPU and XPU support for activation offloading~~ ℹ️ feat: Add NPU and XPU support for activation offloading Sep 18, 2025

Merge branch 'main' into fix/activation_offloading_npu

0c3deac

qgallouedec approved these changes Sep 18, 2025

View reviewed changes

qgallouedec merged commit a6c0c57 into huggingface:main Sep 18, 2025
7 of 10 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

ℹ️ feat: Add NPU and XPU support for activation offloading #4056

ℹ️ feat: Add NPU and XPU support for activation offloading #4056

Uh oh!

zilongzheng commented Sep 10, 2025

Uh oh!

qgallouedec commented Sep 11, 2025

Uh oh!

qgallouedec commented Sep 11, 2025

Uh oh!

HuggingFaceDocBuilderDev commented Sep 18, 2025

Uh oh!

Uh oh!

Uh oh!

ℹ️ feat: Add NPU and XPU support for activation offloading #4056

ℹ️ feat: Add NPU and XPU support for activation offloading #4056

Uh oh!

Conversation

zilongzheng commented Sep 10, 2025

What does this PR do?

Before submitting

Who can review?

Uh oh!

qgallouedec commented Sep 11, 2025

Uh oh!

qgallouedec commented Sep 11, 2025

Uh oh!

HuggingFaceDocBuilderDev commented Sep 18, 2025

Uh oh!

Uh oh!

Uh oh!