hip_cuda_examples

Different NVIDIA CUDA and AMD HIP implementations of matrix multiplication, vector add, reduce operations, and layernorm kernels. Each kernel also uses different data types like fp64, fp32, fp16(half), and half2.

Name		Name	Last commit message	Last commit date
Latest commit History 87 Commits
amd_matrix_core		amd_matrix_core
amd_wmma		amd_wmma
bkup		bkup
common		common
cuda		cuda
cuda_rt		cuda_rt
hip-python		hip-python
hip		hip
hip_rt		hip_rt
others		others
rocblas		rocblas
scripts		scripts
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

hip_cuda_examples

About

Releases

Packages

Languages

scxiao/hip_cuda_examples

Folders and files

Latest commit

History

Repository files navigation

hip_cuda_examples

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages