We read every piece of feedback, and take your input very seriously.
To see all available qualifiers, see our documentation.
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
We support a limited subset of kernels with HIP devices compared to CUDA.
Every algorithm that is CUDA accelerated should also be HIP accelerated.
❯ ag type=CUDA src -l ─╯ src/ztrsm_LUT.jdf src/ztrsm_LLT.jdf src/zgemm_NN.jdf src/ztrsm_LUN.jdf src/zpotrf_U.jdf src/ztrsm_LLN.jdf src/zpoinv_L.jdf src/zgemm_TN_summa.jdf src/zgemm_NT.jdf src/zgemm_NN_summa.jdf src/zgemm_TN.jdf src/ztrsm_RLN.jdf src/ztrsm_RUN.jdf src/zgemm_NN_gpu.jdf src/zgemm_TT_summa.jdf src/zgemm_NT_summa.jdf src/zgemm_TT.jdf src/zpoinv_U.jdf src/ztrsm_RLT.jdf src/ztrsm_RUT.jdf src/zgetrf_nopiv.jdf src/zpotrf_L.jdf src/zgeqrf.jdf ~/parsec/dplasma feature/hip *1 ?6 ······················································································ bouteill@methane 14:48:41 ─╮ ❯ ag type=HIP src -l ─╯ src/zpotrf_U.jdf src/zgemm_TN_summa.jdf src/zgemm_NN_summa.jdf src/zgemm_NN_gpu.jdf src/zgemm_TT_summa.jdf src/zgemm_NT_summa.jdf src/zpotrf_L.jdf
The text was updated successfully, but these errors were encountered:
In particular TRSM is used in check_solution etc, and that can be pretty slow on large matrices without it.
Sorry, something went wrong.
TRSM: #122
abouteiller
No branches or pull requests
Description
We support a limited subset of kernels with HIP devices compared to CUDA.
Describe the solution you'd like
Every algorithm that is CUDA accelerated should also be HIP accelerated.
Additional context
The text was updated successfully, but these errors were encountered: