Skip to content

Releases: sophgo/sophgo-mq

v1.0.1

06 Jun 03:47
Compare
Choose a tag to compare

update torch to 2.3.0

Sophgo-mq v1.0.0 release

22 Mar 11:10
Compare
Choose a tag to compare

Sophgo-mq is an open-source model quantization toolkit forked form MQBench. Sophgo-mq has made some customized modifications and added some features based on MQBench.

Sophgo-mq is to provide:

  • Quantitative Scheme. Fake quantization nodes are inserted into the PyTorch model, and users can customize the insertion points of the nodes as well as the behavior of the fake quantization nodes. After the model training is completed, the corresponding Onnx file as well as calitable and qtable will be exported.
  • Useful Algorithms. Sophgo-mq supports a variety of quantization algorithms, including quantization algorithms for the CV and NLP fields.
  • Mixed Precision. Sophgo-mq supports mixed-precision quantization, which can better ensure the performance of the model after quantization.

int8_qat_for_tpu-mlir

09 Dec 04:16
Compare
Choose a tag to compare
int8_qat_for_tpu-mlir Pre-release
Pre-release

This release is matched with tpu - mlir'release, mainly implements int8 qat function, see usage manual in detail: https://tpumlir.org/docs/developer_manual/index.html, also can see mqbench official manual: https://mqbench.readthedocs.io/en/latest/