You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Memory access fault during execution of updateChecksum().
OFFLOAD ERROR: Memory access fault by GPU 8 (agent 0xa11a80) at virtual address 0xf15000. Reasons: Unknown (0)
Device pointer 0xf15000 does not point into any (current or prior) host-issued allocation (might be a global, stack, or shared location).
To Reproduce
Run HISTOGRAM-OMPTarget. I was also able to reproduce the same issue with MULTI_REDUCE-OMPTarget.
Expected behavior
There is no memory access fault.
Compilers & Libraries (please complete the following information):
Compiler & version: [e.g. GCC 4.9.3]: Clang 20
ROCm version (if applicable): 6.2.1
Additional context
The error occurs during the checksum computation in the calcChecksum method (DataUtils.hpp:387). The problem seems to be that 'dataSpace' (rajaperf::DataSpace::OmpTarget) and check_dataSpace (rajaperf::DataSpace::Host) mismatch. If the data spaces mismatch, it is falsely assumed that ptr (m_counts_final.data()) points to GPU memory. The following call of copyData() (DataUtils.hpp:397) triggers the memory error.
The text was updated successfully, but these errors were encountered:
ThorBl
added a commit
to ROCm/aomp
that referenced
this issue
Nov 13, 2024
Updated script to use the latest version of RAJAPerf. It contains a bug fix for the build error we currently observe. Another bug ticket is still open (LLNL/RAJAPerf#493).
!!! Requires cmake version 3.23 !!!!
Describe the bug
Memory access fault during execution of updateChecksum().
OFFLOAD ERROR: Memory access fault by GPU 8 (agent 0xa11a80) at virtual address 0xf15000. Reasons: Unknown (0)
Device pointer 0xf15000 does not point into any (current or prior) host-issued allocation (might be a global, stack, or shared location).
To Reproduce
Run HISTOGRAM-OMPTarget. I was also able to reproduce the same issue with MULTI_REDUCE-OMPTarget.
Expected behavior
There is no memory access fault.
Compilers & Libraries (please complete the following information):
Additional context
The error occurs during the checksum computation in the calcChecksum method (DataUtils.hpp:387). The problem seems to be that 'dataSpace' (rajaperf::DataSpace::OmpTarget) and check_dataSpace (rajaperf::DataSpace::Host) mismatch. If the data spaces mismatch, it is falsely assumed that ptr (m_counts_final.data()) points to GPU memory. The following call of copyData() (DataUtils.hpp:397) triggers the memory error.
The text was updated successfully, but these errors were encountered: