-
Notifications
You must be signed in to change notification settings - Fork 199
Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
Feat (ptq): for minifloat benchmark (#712)
- Loading branch information
Showing
4 changed files
with
307 additions
and
147 deletions.
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
26 changes: 25 additions & 1 deletion
26
src/brevitas_examples/imagenet_classification/ptq/benchmark/single_command.sh
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -1 +1,25 @@ | ||
python ptq_benchmark_torchvision.py $1 --calibration-dir /scratch/datasets/imagenet_symlink/calibration --validation-dir /scratch/datasets/imagenet_symlink/val --options-to-exclude graph_eq_merge_bias graph_eq_iterations | ||
python ptq_benchmark_torchvision.py $1 --calibration-dir /scratch/datasets/imagenet_symlink/calibration --validation-dir /scratch/datasets/imagenet_symlink/val \ | ||
--quant_format float \ | ||
--scale_factor_type float_scale \ | ||
--weight_bit_width 2 3 4 5 6 7 8 \ | ||
--act_bit_width 2 3 4 5 6 7 8 \ | ||
--weight_mantissa_bit_width 1 2 3 4 5 6 \ | ||
--weight_exponent_bit_width 1 2 3 4 5 6 \ | ||
--act_mantissa_bit_width 1 2 3 4 5 6 \ | ||
--act_exponent_bit_width 1 2 3 4 5 6 \ | ||
--bias_bit_width None \ | ||
--weight_quant_granularity per_channel per_tensor \ | ||
--act_quant_type sym \ | ||
--weight_param_method stats \ | ||
--act_param_method mse \ | ||
--bias_corr True \ | ||
--graph_eq_iterations 20 \ | ||
--graph_eq_merge_bias True \ | ||
--act_equalization layerwise \ | ||
--learned_round False \ | ||
--gptq False \ | ||
--gptq_act_order False \ | ||
--gpfq False \ | ||
--gpfq_p None \ | ||
--uint_sym_act_for_unsigned_values False \ | ||
--act_quant_percentile None \ |
Oops, something went wrong.