Skip to content

CUDA: faster q2_K, q3_K MMQ + int8 tensor cores #4425

CUDA: faster q2_K, q3_K MMQ + int8 tensor cores

CUDA: faster q2_K, q3_K MMQ + int8 tensor cores #4425