-
Notifications
You must be signed in to change notification settings - Fork 149
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
refactor: set config into weights for quantization feature support more easily #400
refactor: set config into weights for quantization feature support more easily #400
Conversation
Thanks for the PR @thincal! I noticed there are a couple of issues I'm getting when attempting to test this with AWQ and GPTQ quants. Do you know what's going on here?
Output:
Same error with |
@tgaddair thanks for the test, I have checked the field name from config.json is wrongly specified, already fixed now. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM! Verified it works with AWQ, GTPQ, and unquantized models.
What does this PR do?
Fixes #399
Before submitting
to it if that's the case.
Who can review?
Anyone in the community is free to review the PR once the tests have passed. Feel free to tag
members/contributors who may be interested in your PR.
@tgaddair, thanks.