What is role of scaling_per_output_channel in QuantReLU? #791

phixerino · 2024-01-05T16:27:53Z

I'm looking at MobileNetV1 example and I see that scaling_per_output_channel is True in QuantReLU after the first layer (init_block) and then after each pointwise convolutional layer except for the last stage.
On the other hand in ProxylessNAS Mobile14 the scaling_per_output_channel is False after the first layer and then its True after each first 1x1 convolutional layer in ProxylessBlock.
So whats the purpose of scaling_per_output_channel? Thank you

The text was updated successfully, but these errors were encountered:

Giuseppe5 · 2024-01-21T14:14:49Z

Similar to what happens for weight scaling, you can have one scale factor for the entire tensor to quantize, or one per each channel of said tensor. Other slicing of the tensor to compute scale factors are also possible, although arguably less common (e.g., per-row, per-group, etc.).

The use of per tensor vs per channel depends on the network topology, hardware constraints of the device where you plan to execute your network, and other factors.

As a rule of thumb, the more fine grained the granularity of your scale factors, the better the final accuracy of the quantized network. Similarly, the computational cost and memory usage of your network will increase since scaling factors are stored in high precision.

Giuseppe5 closed this as completed Feb 13, 2024

phixerino mentioned this issue Feb 16, 2024

QuantReLU scale factors #859

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

What is role of scaling_per_output_channel in QuantReLU? #791

What is role of scaling_per_output_channel in QuantReLU? #791

phixerino commented Jan 5, 2024 •

edited

Loading

Giuseppe5 commented Jan 21, 2024

What is role of scaling_per_output_channel in QuantReLU? #791

What is role of scaling_per_output_channel in QuantReLU? #791

Comments

phixerino commented Jan 5, 2024 • edited Loading

Giuseppe5 commented Jan 21, 2024

phixerino commented Jan 5, 2024 •

edited

Loading