Releases: sophgo/sophgo-mq
Releases · sophgo/sophgo-mq
v1.0.1
Sophgo-mq v1.0.0 release
Sophgo-mq is an open-source model quantization toolkit forked form MQBench. Sophgo-mq has made some customized modifications and added some features based on MQBench.
Sophgo-mq is to provide:
- Quantitative Scheme. Fake quantization nodes are inserted into the PyTorch model, and users can customize the insertion points of the nodes as well as the behavior of the fake quantization nodes. After the model training is completed, the corresponding Onnx file as well as calitable and qtable will be exported.
- Useful Algorithms. Sophgo-mq supports a variety of quantization algorithms, including quantization algorithms for the CV and NLP fields.
- Mixed Precision. Sophgo-mq supports mixed-precision quantization, which can better ensure the performance of the model after quantization.
int8_qat_for_tpu-mlir
This release is matched with tpu - mlir'release, mainly implements int8 qat function, see usage manual in detail: https://tpumlir.org/docs/developer_manual/index.html, also can see mqbench official manual: https://mqbench.readthedocs.io/en/latest/