New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

Sign up for GitHub

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Jump to bottom

Add log1p elementwise op #993

Open

22quinn wants to merge 1 commit into facebookincubator:main from 22quinn:export-D54176180

Contributor

22quinn commented Feb 26, 2024

Summary:
log1p(x) is more precise than log(1+x) when x is close to 0. We utilize cuda log1pf implementation for fp32. For other precision types, input is first converted to float, then log1pf is computed, finally output is converted back to original precision.

CUDA log1pf function for float and double: https://docs.nvidia.com/cuda/cuda-math-api/group__CUDA__MATH__SINGLE.html

Differential Revision: D54176180

facebook-github-bot added the CLA Signed label

Contributor

facebook-github-bot commented Feb 26, 2024

This pull request was exported from Phabricator. Differential Revision: D54176180

facebook-github-bot added the fb-exported label

22quinn force-pushed the export-D54176180 branch from b7a9b04 to c938fba Compare

March 1, 2024 01:39

Contributor

facebook-github-bot commented Mar 1, 2024

This pull request was exported from Phabricator. Differential Revision: D54176180

22quinn force-pushed the export-D54176180 branch from c938fba to 63ad837 Compare

March 1, 2024 01:52

Contributor

facebook-github-bot commented Mar 1, 2024

This pull request was exported from Phabricator. Differential Revision: D54176180

22quinn force-pushed the export-D54176180 branch from 63ad837 to 0902435 Compare

March 1, 2024 01:52

Contributor

facebook-github-bot commented Mar 1, 2024

This pull request was exported from Phabricator. Differential Revision: D54176180


          Add log1p elementwise op

23e93f2

Summary:
`log1p(x)` is more precise than `log(1+x)` when `x` is close to 0. We utilize cuda `log1pf` implementation for fp32. For other precision types, input is first converted to float, then `log1pf` is computed, finally output is converted back to original precision.

CUDA log1pf function for float and double: https://docs.nvidia.com/cuda/cuda-math-api/group__CUDA__MATH__SINGLE.html

Reviewed By: frank-wei

Differential Revision: D54176180

22quinn force-pushed the export-D54176180 branch from 0902435 to 23e93f2 Compare

March 1, 2024 05:27

Contributor

facebook-github-bot commented Mar 1, 2024

This pull request was exported from Phabricator. Differential Revision: D54176180

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

CLA Signed fb-exported