Skip to content

CUDA: faster q2_K, q3_K MMQ + int8 tensor cores #13224

CUDA: faster q2_K, q3_K MMQ + int8 tensor cores

CUDA: faster q2_K, q3_K MMQ + int8 tensor cores #13224