Skip to content

[NVIDIA GPU] Optimize collective matmul loops when contracting dim is sharded #3651

[NVIDIA GPU] Optimize collective matmul loops when contracting dim is sharded

[NVIDIA GPU] Optimize collective matmul loops when contracting dim is sharded #3651

no-rocm-only-targets-in-cpu-build

succeeded Oct 3, 2024 in 1m 27s