Test on CNN model containing group conv by GPTQ method #56

xd1073321804 · 2024-05-23T03:24:39Z

Hi,
for supportting CNN mode, I modified the GPTQ code as follows:
1, supportting group conv;
2, use symmetric quantization without zero point parameter.

But I found it performance not good on mobilenetv2/mnasnet1_0 models when quantization bits = 4.
Here are my results:
model | FP32 | GPTQ_W4 sym
mbv2 71.88 60.84(84.64%)
mnasnet1_0 73.47 64.71(88.08%)
I saw resnet18/resnet50 quantization result in your paper only, have you tested gptq on mobilenetv2/mnasnet1_0 model?

Looking forward to your reply...

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Test on CNN model containing group conv by GPTQ method #56

Test on CNN model containing group conv by GPTQ method #56

xd1073321804 commented May 23, 2024 •

edited

Loading

Test on CNN model containing group conv by GPTQ method #56

Test on CNN model containing group conv by GPTQ method #56

Comments

xd1073321804 commented May 23, 2024 • edited Loading

xd1073321804 commented May 23, 2024 •

edited

Loading