-
Notifications
You must be signed in to change notification settings - Fork 394
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Edit mixed precision pages in feature guide. #3755
base: develop
Are you sure you want to change the base?
Edit mixed precision pages in feature guide. #3755
Conversation
Signed-off-by: Dave Welsch <[email protected]>
Prerequisites | ||
------------- | ||
|
||
Manual mixed precision is supported only on PyTorch models. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Is this true? I assumed it is because I didn't see APIs for TF or ONNX.
* Change the precision of all the layers in the model of a certain type | ||
* Change the precision of model input tensors (or only a subset of input tensors) | ||
* Change the precision of model output tensors (or only a subset of output tensors) | ||
* A leaf layer |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
How is a leaf layer defined?
Not supported. | ||
|
||
|
||
The ``apply`` call generates a report detailing how the request was inferred, propagated to other layers, and eventually realized. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Where is the report saved?
Layer Groups are defined as a group of layers grouped together based on certain rules. | ||
This helps in reducing search space over which the mixed precision algorithm operates. | ||
It also ensures that we search only over the valid bit-width settings for parameters and activations. | ||
Layer Groups are defined based on certain rules. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
What are the rules and how are they defined?
|
||
.. image:: ../../images/pareto.png | ||
:width: 900px | ||
An example of a Pareto list: |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
How is this list different from the accuracy list in the previous phase?
|
||
Use Cases | ||
========= | ||
Conversion operations (convert ops) are introduced in the mixed-precision model for transition between ops with different activation bit widths or data types (float vs int). Convert ops contribute to the inference time along with bit-operations of ops. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I assume that 'convert op' is jargon for 'conversion operation'. Is that the case?
No description provided.