Replies: 1 comment 1 reply
-
@AlexKoff88, @alexsu52, anyone can help from your end |
Beta Was this translation helpful? Give feedback.
1 reply
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
Hello,
I am currently working on a project to perform inference with anomaly detection (AD) models on an embedded system and am investigating ways to perform quantization.
Until now I trained a PaDiM model on a custom dataset, which performes very well.
I have already worked through the OpenVINO NNCF notebook but realized that the insertion of "Fake Quantization Layers" does not suit my project's needs.
I also attempted to quantize the trained PaDiM model, which I converted to TensorFlow, using the TensorFlow Lite converter, but this approach also seems to be a dead end.
Is there a way to perform "true" quantization to int8 with nncf?
Or could someone point me in the right direction?
Beta Was this translation helpful? Give feedback.
All reactions