You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Is your feature request related to a problem? Please describe.
Need to zero-out a large array of GPU memory ("fast" way to zero out device memory).
Describe the solution you'd like
Want to allocate a large array and zero it out. We could call malloc_zero_out_kernel(nbytes); or something like that instead of having to write our own kernel to zero it out. Have a built-in umpire function to do that.
Describe alternatives you've considered
Using the resource manager to do a memset takes too long. Allocating an array and then launching a kernel to zero out memory could work, but that adds more code.
Is your feature request related to a problem? Please describe.
Need to zero-out a large array of GPU memory ("fast" way to zero out device memory).
Describe the solution you'd like
Want to allocate a large array and zero it out. We could call
malloc_zero_out_kernel(nbytes);
or something like that instead of having to write our own kernel to zero it out. Have a built-in umpire function to do that.Describe alternatives you've considered
Using the resource manager to do a memset takes too long. Allocating an array and then launching a kernel to zero out memory could work, but that adds more code.
Additional context
See teams conversation here.
The text was updated successfully, but these errors were encountered: