Skip to content

CUDA: faster q2_K, q3_K MMQ + int8 tensor cores #1338

CUDA: faster q2_K, q3_K MMQ + int8 tensor cores

CUDA: faster q2_K, q3_K MMQ + int8 tensor cores #1338