A question about instancing brevitas quantizer to act layer #875

RyougiKukoc · 2024-02-25T15:12:43Z

RyougiKukoc
Feb 25, 2024

I want to quantized an unimplemented active layer in brevitas (such as nn.GELU), after taking a glance at QuantReLU and QuantTanh, I wonder if it is right to write as follow?

from brevitas.nn.quant_layer import QuantNonLinearActLayer
from brevitas.quant import Int8ActPerTensorFloat
from torch import nn
quant_gelu = QuantNonLinearActLayer(
    act_impl=nn.GELU, 
    passthrough_act=True, 
    input_quant=None, 
    act_quant=Int8ActPerTensorFloat,
    return_quant_tensor=True
)

Answered by Giuseppe5

Feb 26, 2024

This works, and it would be basically equivalent to having nn.GeLU followed by a QuantIdentity(act_quant=Int8ActPerTensorFloat).

The thing I would point out is that passthrough_act in this case should be False.

View full answer

Giuseppe5 · 2024-02-26T15:44:35Z

Giuseppe5
Feb 26, 2024
Maintainer

This works, and it would be basically equivalent to having nn.GeLU followed by a QuantIdentity(act_quant=Int8ActPerTensorFloat).

The thing I would point out is that passthrough_act in this case should be False.

5 replies

RyougiKukoc Feb 26, 2024
Author

Thanks!
Well I cannot figure out what does passthrough_act mean since QuantNonLinearActLayer is a hidden class, would you explain somehow?

Giuseppe5 Feb 26, 2024
Maintainer

You can check the differences between passthrough_act=True (like in QuantReLU) and other quant activations in our tutorial: https://github.com/Xilinx/brevitas/blob/master/notebooks/02_quant_activation_overview.ipynb

RyougiKukoc Feb 26, 2024
Author

Oh, so it means passing through the input of act layer, like x>0 for RELU(x)?

Giuseppe5 Feb 28, 2024
Maintainer

If a quantized activation (e.g., QuantReLU) has its quantization disabled (for any reason), there could be two main behaviours:

Pass-through activation: If the input is a QuantTensor, the output will be a QuantTensor with the same quantization metadata (e.g, scale, bitwidth, etc.) of the original input tensor.
Non pass-through: Whether the input is a QuantTensor or not, the output will be a PyTorch's tensor (dequantized).

QuantReLU behaves as pass-through quantized activation. However I believe that GeLU should not.

As you can see it's a subtle difference that applies in very specific cases, and if your quant activation has its own quantizer specified and enabled, the flag is not used.

RyougiKukoc Feb 28, 2024
Author

Thank you very much. You've been a great help.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

A question about instancing brevitas quantizer to act layer #875

{{title}}

Replies: 1 comment 5 replies

{{title}}

{{editor}}'s edit

{{editor}}'s edit

{{title}}

{{title}}

{{title}}

{{title}}

{{title}}

Select a reply

A question about instancing brevitas quantizer to act layer #875

RyougiKukoc Feb 25, 2024

Replies: 1 comment · 5 replies

Giuseppe5 Feb 26, 2024 Maintainer

RyougiKukoc Feb 26, 2024 Author

Giuseppe5 Feb 26, 2024 Maintainer

RyougiKukoc Feb 26, 2024 Author

Giuseppe5 Feb 28, 2024 Maintainer

RyougiKukoc Feb 28, 2024 Author

RyougiKukoc
Feb 25, 2024

Replies: 1 comment 5 replies

Giuseppe5
Feb 26, 2024
Maintainer

RyougiKukoc Feb 26, 2024
Author

Giuseppe5 Feb 26, 2024
Maintainer

RyougiKukoc Feb 26, 2024
Author

Giuseppe5 Feb 28, 2024
Maintainer

RyougiKukoc Feb 28, 2024
Author