Skip to content

CUDA: faster q2_K, q3_K MMQ + int8 tensor cores #10553

CUDA: faster q2_K, q3_K MMQ + int8 tensor cores

CUDA: faster q2_K, q3_K MMQ + int8 tensor cores #10553

Triggered via pull request June 14, 2024 15:51
Status Success
Total duration 14m 4s
Artifacts

python-lint.yml

on: pull_request
Fit to window
Zoom out
Zoom in