Skip to content

Allow GPTQModel to auto select Marlin or faster kernels for inference only ops #6446

Allow GPTQModel to auto select Marlin or faster kernels for inference only ops

Allow GPTQModel to auto select Marlin or faster kernels for inference only ops #6446

build (3.9, ubuntu-20.04)

succeeded Jan 7, 2025 in 2m 31s