This project aims to implement and evaluate quantization techniques on machine learning models deployed to Jetson Nano edge devices.
Install dependencies: pip install -r requirements.txt
Create quantized model from scratch: make
- Extract unquantized parameters:
make setup
- Calculate quantized parameters:
make quantize_params
- Create calibration prep model:
make prep_model
- Calculate quantized activations:
make quantize_activations
- Calculate quantized biases:
make quantize_biases
- Statically quantize model:
make quantize_model_symm
- Dynamically quantize model:
make quantize_model_dyn_symm
- Extract unquantized parameters:
make setup
- Calculate quantized parameters:
make quantize_params_asymm
- Create calibration prep model:
make prep_model_asymm
- Calculate quantized activations:
make quantize_activations_asymm
- Calculate quantized biases:
make quantize_biases_asymm
- Statically quantize model:
make quantize_model_asymm
- Dynamically quantize model:
make quantize_model_dyn_asymm
Validate quantized parameters: make validate
Clear activations/
, biases/
, params/
, and models/
directories: make clean