GPTQ-INT8 quantization details: W8A16 or W8A8? #1058

molereddy · 2024-11-06T18:16:18Z

molereddy
Nov 6, 2024

Can you clarify on if the 8-bit GPTQ quantized models use 8-bit activations or 16-bit? Model link: https://huggingface.co/Qwen/Qwen2.5-72B-Instruct-GPTQ-Int8

jklj077 · 2024-11-07T10:51:25Z

GPTQ is weight-quantization. GPTQ-INT8 is 8bit weight + fp16 scales + fp16 zeros (with fp16 activation).

0 replies