You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
So it that true that the NTuple type in gpu kernel are stored in registers? Or could you please tell me how to allocate memory on registers?
Generally, yes. In Julia, tuple-valued objects are emitted as LLVM IR that keeps the data stored in registers. It's possible that ptxas, the back-end compiler, will spill these to local memory (which is just device memory, i.e., slow), but that's rare.
Do note that there's other limits though, e.g. #99, where compiler heuristics prevent creation of large tuples (as they're generally pretty expensive, causing large IR, and for CPU execution are better moved to the heap).
Hello, I have some questions about usage of registers.
In
GemmKernels.jl/kernel.jl
file, functionmatmul_singlestage
, I foundwhich seems simply allocate some memory on the registers, but I found that the
LocalArray
is defined bywhere the data are stored as
NTuple
.So it that true that the NTuple type in gpu kernel are stored in registers? Or could you please tell me how to allocate memory on registers?
The text was updated successfully, but these errors were encountered: