Skip to content

Update the rowwise adagrad optimizer to leverage optimizer state offloading, v3 #4133

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Open
wants to merge 1 commit into
base: main
Choose a base branch
from

Conversation

q10
Copy link
Contributor

@q10 q10 commented May 15, 2025

Summary:

  • Add compile-time flag to enable working with optimizer offloading
  • Update the rowwise adagrad optimizer to leverage optimizer state offloading
    if the optimizer is TBE SSD, and if the user specified it

Differential Revision: D74827718

Copy link

netlify bot commented May 15, 2025

Deploy Preview for pytorch-fbgemm-docs ready!

Name Link
🔨 Latest commit acacb25
🔍 Latest deploy log https://app.netlify.com/projects/pytorch-fbgemm-docs/deploys/682d5d0b1f33330008a0b8ac
😎 Deploy Preview https://deploy-preview-4133--pytorch-fbgemm-docs.netlify.app
📱 Preview on mobile
Toggle QR Code...

QR Code

Use your smartphone camera to open QR code link.

To edit notification comments on pull requests, go to your Netlify project configuration.

@facebook-github-bot
Copy link
Contributor

This pull request was exported from Phabricator. Differential Revision: D74827718

@facebook-github-bot
Copy link
Contributor

This pull request was exported from Phabricator. Differential Revision: D74827718

q10 added a commit to q10/FBGEMM that referenced this pull request May 16, 2025
…oading, v3, backend (pytorch#4133)

Summary:

X-link: facebookresearch/FBGEMM#1214

This diff adds support for leveraging optimizer state offloading to make optimizer state updates, starting with the rowwise adagrad optimizer.

- Add compile-time flag `kEnableOptimizerOffloading` to the table update kernel to enable handling optimizer offloading, starting with the rowwise adagrad case
- Propagate the compile-time flag upwards to `embedding_backward_split_template.cu`, where it is a runtime user-supplied boolean argument

Differential Revision: D74827718
@q10 q10 force-pushed the export-D74827718 branch from 3c6130d to 442366d Compare May 16, 2025 23:26
@facebook-github-bot
Copy link
Contributor

This pull request was exported from Phabricator. Differential Revision: D74827718

q10 added a commit to q10/FBGEMM that referenced this pull request May 16, 2025
…oading, v3, backend (pytorch#4133)

Summary:
Pull Request resolved: pytorch#4133

X-link: facebookresearch/FBGEMM#1214

This diff adds support for leveraging optimizer state offloading to make optimizer state updates, starting with the rowwise adagrad optimizer.

- Add compile-time flag `kEnableOptimizerOffloading` to the table update kernel to enable handling optimizer offloading, starting with the rowwise adagrad case
- Propagate the compile-time flag upwards to `embedding_backward_split_template.cu`, where it is a runtime user-supplied boolean argument

Differential Revision: D74827718
@q10 q10 force-pushed the export-D74827718 branch from 442366d to e39b485 Compare May 16, 2025 23:30
q10 added a commit to q10/FBGEMM that referenced this pull request May 20, 2025
…oading, v3, backend (pytorch#4133)

Summary:

X-link: facebookresearch/FBGEMM#1214

This diff adds support for leveraging optimizer state offloading to make optimizer state updates, starting with the rowwise adagrad optimizer.

- Add compile-time flag `kEnableOptimizerOffloading` to the table update kernel to enable handling optimizer offloading, starting with the rowwise adagrad case
- Propagate the compile-time flag upwards to `embedding_backward_split_template.cu`, where it is a runtime user-supplied boolean argument

Differential Revision: D74827718
@q10 q10 force-pushed the export-D74827718 branch from e39b485 to 758f8e0 Compare May 20, 2025 21:40
@facebook-github-bot
Copy link
Contributor

This pull request was exported from Phabricator. Differential Revision: D74827718

@q10 q10 force-pushed the export-D74827718 branch from 758f8e0 to cc37c8c Compare May 21, 2025 04:48
q10 added a commit to q10/FBGEMM that referenced this pull request May 21, 2025
…oading, v3, backend (pytorch#4133)

Summary:

X-link: facebookresearch/FBGEMM#1214

This diff adds support for leveraging optimizer state offloading to make optimizer state updates, starting with the rowwise adagrad optimizer.

- Add compile-time flag `kEnableOptimizerOffloading` to the table update kernel to enable handling optimizer offloading, starting with the rowwise adagrad case
- Propagate the compile-time flag upwards to `embedding_backward_split_template.cu`, where it is a runtime user-supplied boolean argument

Differential Revision: D74827718
q10 added a commit to q10/FBGEMM that referenced this pull request May 21, 2025
…oading, v3, backend (pytorch#4133)

Summary:

X-link: facebookresearch/FBGEMM#1214

This diff adds support for leveraging optimizer state offloading to make optimizer state updates, starting with the rowwise adagrad optimizer.

- Add compile-time flag `kEnableOptimizerOffloading` to the table update kernel to enable handling optimizer offloading, starting with the rowwise adagrad case
- Propagate the compile-time flag upwards to `embedding_backward_split_template.cu`, where it is a runtime user-supplied boolean argument

Differential Revision: D74827718
@q10 q10 force-pushed the export-D74827718 branch from cc37c8c to 8a5ddd3 Compare May 21, 2025 04:49
@facebook-github-bot
Copy link
Contributor

This pull request was exported from Phabricator. Differential Revision: D74827718

q10 added a commit to q10/FBGEMM that referenced this pull request May 21, 2025
…oading, v3, backend (pytorch#4133)

Summary:
Pull Request resolved: pytorch#4133

X-link: facebookresearch/FBGEMM#1214

This diff adds support for leveraging optimizer state offloading to make optimizer state updates, starting with the rowwise adagrad optimizer.

- Add compile-time flag `kEnableOptimizerOffloading` to the table update kernel to enable handling optimizer offloading, starting with the rowwise adagrad case
- Propagate the compile-time flag upwards to `embedding_backward_split_template.cu`, where it is a runtime user-supplied boolean argument

Differential Revision: D74827718
@q10 q10 force-pushed the export-D74827718 branch from 8a5ddd3 to 5e2d988 Compare May 21, 2025 04:51
…oading, v3, backend (pytorch#4133)

Summary:
Pull Request resolved: pytorch#4133

X-link: facebookresearch/FBGEMM#1214

This diff adds support for leveraging optimizer state offloading to make optimizer state updates, starting with the rowwise adagrad optimizer.

- Add compile-time flag `kEnableOptimizerOffloading` to the table update kernel to enable handling optimizer offloading, starting with the rowwise adagrad case
- Propagate the compile-time flag upwards to `embedding_backward_split_template.cu`, where it is a runtime user-supplied boolean argument

Differential Revision: D74827718
@facebook-github-bot
Copy link
Contributor

This pull request was exported from Phabricator. Differential Revision: D74827718

@q10 q10 force-pushed the export-D74827718 branch from 5e2d988 to acacb25 Compare May 21, 2025 04:56
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants