Skip to content

Commit

Permalink
Update README.md
Browse files Browse the repository at this point in the history
  • Loading branch information
chilo-ms authored Oct 23, 2024
1 parent f56a16c commit 7d2ece0
Showing 1 changed file with 2 additions and 0 deletions.
2 changes: 2 additions & 0 deletions quantization/image_classification/trt/resnet50/README.md
Original file line number Diff line number Diff line change
@@ -1,6 +1,8 @@
# ONNX PTQ overview
Following is the end-to-end example using ORT quantization tool to quantize ONNX model, specifially image classification model, and run/evaluate the quantized model with TRT EP.

## Note
Implicit quantization is deprecated in TRT 10.1 (using calibration table in TRT EP to set `setDynamicRange`), we suggest to use explicit quantization aka QDQ format.
## Environment setup
### dataset
First, prepare the dataset for calibration. TensorRT recommends calibration data size to be at least 500 for CNN and ViT models.
Expand Down

0 comments on commit 7d2ece0

Please sign in to comment.