Docs: Update Math API page #3738

adeljo-amd · 2025-02-03T16:02:55Z

The goal of this PR is to improve the usability of the Math API page, by providing information about each function's maximum ULP error when compared against the C++ standard library (if applicable). It also highlights specific math functions which are unsupported too.

docs/reference/math_api.rst

randyh62

Looks good to me, except for the missing data related to the functions that don't run on GPUs.

docs/reference/math_api.rst

neon60 · 2025-02-07T09:01:54Z

@adeljo-amd Could you please rebase the branch?

adeljo-amd · 2025-02-07T09:10:08Z

@adeljo-amd Could you please rebase the branch?

Done

adeljo-amd · 2025-02-19T12:58:00Z

@neon60 Updated and added test ranges too

g-h-c · 2025-02-20T12:06:54Z

I find the explanation of ffs() might lack information. ffs() returns 0 if the integer has no bit set to 1. i.e.

ffs(0) returns 0
ffs(x) returns the position of the first bit set in x +1

Also should we warn that fns() calls are potentially slower? (as the compiler does not have a intrinsic for them, unlike ffs()):

ffs() uses the ff1 instruction present both in RDNA and CDNA. See https://godbolt.org/z/3ncG3E99o
fns() has no equivalent intrinsic, and it is implemented with a loop in the CLR.

adeljo-amd · 2025-02-20T12:24:40Z

I find the explanation of ffs() might lack information. ffs() returns 0 if the integer has no bit set to 1. i.e.

ffs(0) returns 0

ffs(x) returns the position of the first bit set in x +1

Also should we warn that fns() calls are potentially slower? (as the compiler does not have a intrinsic for them, unlike ffs()):

ffs() uses the ff1 instruction present both in RDNA and CDNA. See https://godbolt.org/z/3ncG3E99o

fns() has no equivalent intrinsic, and it is implemented with a loop in the CLR.

@g-h-c These are fair suggestions - it probably won't be necessary to mention that ffs() calls ff1, but it would be good to highlight that fns() is implemented via software rather than via hardware intrinsics.

Besides updating the ffs() description, I think it would be good to align this with ffsll(), which I imagine has the same behavior

adeljo-amd · 2025-02-21T08:58:41Z

@g-h-c Fixed, let me know what you think

jujiang-del · 2025-02-25T20:23:49Z

docs/reference/math_api.rst

+math functions listed below are available on the device side.
+
+Arithmetic
+----------


Looked through math APIs, please double check the following,
Supported functions, missing:
float fdividef(float x, float y)
Divide two floating point values.

float fmaf(float x, float y, float z)
Returns x⋅y+z as a single operation.

float fmaxf(float x, float y)
Determine the maximum numeric value of x and y.

float fminf(float x, float y)
Determine the minimum numeric value of x and y.

float fmodf(float x, float y)
Returns the floating-point remainder of x/y.

float hypotf(float x, float y)
Returns the square root of the sum of squares of x and y.

int ilogbf(float x)
Returns the unbiased integer exponent of x.

float tgammaf(float x)
Returns the gamma function of x.

double erfcinv(double x)
Returns the inverse complementary function of x.

double erfinv(double x)
Returns the inverse error function of x.

double j0(double x)
Returns the value of the Bessel function of the first kind of order 0 for x.

Unsupported on the device, missing.
float frexpf(float x, int* nptr)
Extract mantissa and exponent of x.

float modff(float x, float* iptr)
Break down x into fractional and integral parts.

float lgammaf(float x)
Returns the natural logarithm of the absolute value of the gamma function of x.

float nextafterf(float x, float y)
Returns next representable single-precision floating-point value after argument.

double remquo(double x, double y, int* quo)
Returns double-precision floating-point remainder and part of quotient.

Thanks @jujiang-del , I am indeed missing fdividef, but I'm not sure I follow the rest of the list, as they are already in there. Even the ones you listed as unsupported are in the page - and it would appear that they are supported on device as I was able to run them in kernels (it's how I got the ULP differences) on my local machine

adeljo-amd added ci:docs-only Only run Read the Docs CI on this PR documentation labels Feb 3, 2025

adeljo-amd requested a review from neon60 February 3, 2025 16:02

adeljo-amd self-assigned this Feb 3, 2025

adeljo-amd requested review from chrispaquot, gandryey, saleelk, mangupta and rakesroy as code owners February 3, 2025 16:02

adeljo-amd force-pushed the math_doc branch 2 times, most recently from a1238f8 to e9be8fc Compare February 3, 2025 16:07

adeljo-amd changed the title ~~Draft: Docs: Update Math API page~~ Docs: Update Math API page Feb 5, 2025

randyh62 reviewed Feb 5, 2025

View reviewed changes

docs/reference/math_api.rst Outdated Show resolved Hide resolved

randyh62 approved these changes Feb 5, 2025

View reviewed changes

docs/reference/math_api.rst Show resolved Hide resolved

docs/reference/math_api.rst Show resolved Hide resolved

docs/reference/math_api.rst Outdated Show resolved Hide resolved

neon60 force-pushed the docs/develop branch from c36483f to 8a8a110 Compare February 6, 2025 14:42

adeljo-amd force-pushed the math_doc branch from e9be8fc to 8922c17 Compare February 7, 2025 08:34

adeljo-amd force-pushed the math_doc branch from 8922c17 to ba4e2dd Compare February 7, 2025 09:09

adeljo-amd force-pushed the math_doc branch 3 times, most recently from 01057c5 to e164839 Compare February 19, 2025 12:57

adeljo-amd force-pushed the math_doc branch from e164839 to 1bdd504 Compare February 20, 2025 08:20

neon60 approved these changes Feb 20, 2025

View reviewed changes

adeljo-amd requested a review from lpaoletti February 20, 2025 11:32

adeljo-amd force-pushed the math_doc branch from 1bdd504 to 955be69 Compare February 21, 2025 08:58

jujiang-del reviewed Feb 25, 2025

View reviewed changes

adeljo-amd force-pushed the math_doc branch from 955be69 to 09a5a4d Compare February 26, 2025 08:56

adeljo-amd force-pushed the math_doc branch from 09a5a4d to 155b1d8 Compare March 6, 2025 12:51

Docs: Update math api page

516400f

adeljo-amd force-pushed the math_doc branch from 155b1d8 to 516400f Compare March 6, 2025 13:11

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Docs: Update Math API page #3738

Docs: Update Math API page #3738

adeljo-amd commented Feb 3, 2025

randyh62 left a comment

neon60 commented Feb 7, 2025

adeljo-amd commented Feb 7, 2025

adeljo-amd commented Feb 19, 2025

g-h-c commented Feb 20, 2025

adeljo-amd commented Feb 20, 2025

adeljo-amd commented Feb 21, 2025

jujiang-del Feb 25, 2025

adeljo-amd Feb 25, 2025

jujiang-del Mar 7, 2025

Docs: Update Math API page #3738

Are you sure you want to change the base?

Docs: Update Math API page #3738

Conversation

adeljo-amd commented Feb 3, 2025

randyh62 left a comment

Choose a reason for hiding this comment

neon60 commented Feb 7, 2025

adeljo-amd commented Feb 7, 2025

adeljo-amd commented Feb 19, 2025

g-h-c commented Feb 20, 2025

adeljo-amd commented Feb 20, 2025

adeljo-amd commented Feb 21, 2025

jujiang-del Feb 25, 2025

Choose a reason for hiding this comment

adeljo-amd Feb 25, 2025

Choose a reason for hiding this comment

jujiang-del Mar 7, 2025

Choose a reason for hiding this comment