-
Notifications
You must be signed in to change notification settings - Fork 19
Pull requests: huggingface/optimum-amd
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
Fix (brevitas): Bumped Brevitas version to 'brevitas>=0.11'
#166
opened Nov 5, 2024 by
nickfraser
Loading…
Fix (example/brevitas): GPTQ should be applied before calibration.
#165
opened Nov 5, 2024 by
nickfraser
Loading…
Fix (example/brevitas): Allow setting more device options (e.g., 'cuda:1')
#164
opened Nov 5, 2024 by
nickfraser
Loading…
Adding a script to test accuracy on IPU provider after quantization with Quark
#148
opened Jul 17, 2024 by
hanlin0628
Loading…
Fix (brevitas/bias_correction): Add zero-valued bias to linear layers when accelerate is enabled.
#139
opened May 30, 2024 by
nickfraser
Loading…
Add a script for timm models quantization and evaluation
#138
opened May 30, 2024 by
ChaoLi-AMD
Loading…
Fix perplexity computation, MQA/GQA models & models requiring
position_ids
#129
opened Apr 10, 2024 by
fxmarty
Loading…
ProTip!
Add no:assignee to see everything that’s not assigned.