You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
I've stumbled into this several times in the past. It could be really frustrating and confusing, but at the same time it could also be related to my setup as I wasn't able to reproduce it anywhere byt my own workstation. I still wanted to record it in case it was an issue for someone else.
Consider the simple snippet:
const target =ifhere.gpus.size >0thenhere.gpus[0]elsehere;
on target {
var Arr:[1..1_000_000]int;
@gpu.assertEligible
foreach elem in Arr do
elem +=1;
writeln(+reduce Arr);
}
When I compile and run this on RTX A2000 with CUDA 12.4, it takes about 1 minute for the first run to finish. If I run it again soon thereafter, it is almost instantaneous. Whenever I run into this, I think there is a bug and start debugging after Ctrl+C'ing the execution. FWIW, --debugGpu shows that we call into CUB helpers from the host for the final round of reduction and that the CUB functions seems to take forever to finish. This could be related to NVIDIA Persistence Daemon issues we have seen in the past. That issue is documented in the GPU technote.
The text was updated successfully, but these errors were encountered:
I've stumbled into this several times in the past. It could be really frustrating and confusing, but at the same time it could also be related to my setup as I wasn't able to reproduce it anywhere byt my own workstation. I still wanted to record it in case it was an issue for someone else.
Consider the simple snippet:
When I compile and run this on RTX A2000 with CUDA 12.4, it takes about 1 minute for the first run to finish. If I run it again soon thereafter, it is almost instantaneous. Whenever I run into this, I think there is a bug and start debugging after Ctrl+C'ing the execution. FWIW,
--debugGpu
shows that we call into CUB helpers from the host for the final round of reduction and that the CUB functions seems to take forever to finish. This could be related to NVIDIA Persistence Daemon issues we have seen in the past. That issue is documented in the GPU technote.The text was updated successfully, but these errors were encountered: