You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Let's implement and maintain Q4_0 and Q8_0 data type weight quantization tool to make test and example model.
It is temporary tool, and not for compiler module implementation.
Why
To help onert's LLM support feature development, we need tool to generate weight block quantization tool from fp32 circle test model.
It will also help PoC for circle schema update to support LLM model.
Type
Q4_0
Q8_0
Target operand
Gather's params
FullyConnected's weight
The text was updated successfully, but these errors were encountered:
What
Let's implement and maintain Q4_0 and Q8_0 data type weight quantization tool to make test and example model.
It is temporary tool, and not for compiler module implementation.
Why
To help onert's LLM support feature development, we need tool to generate weight block quantization tool from fp32 circle test model.
It will also help PoC for circle schema update to support LLM model.
The text was updated successfully, but these errors were encountered: