Remove type restrictions on normalization layers' scale and bias #2099

darsnack · 2022-11-03T18:42:53Z

Right now, the normalization scale and bias are required to be the same type. This is an unnecessary restriction that isn't there for the weight and biases of other layers.

PR Checklist

~~Tests are added~~
~~Entry in NEWS.md~~
~~Documentation, if applicable~~

ToucheSir · 2022-11-03T20:14:55Z

src/layers/normalise.jl

  β::V  # bias
-  γ::V  # scale
+  γ::U  # scale


I don't know where the original type param names came from, but would it make sense to harmonize them with the actual parameter names? e.g. B,S instead of U,V.

mcabbott · 2022-11-06T00:06:54Z

Should we consider wrapping these two in a Scale sub-layer? Replaced with identity when absent; perhaps that would allow also removing the affine field.

ToucheSir · 2022-11-06T05:27:14Z

It's complicated because backends like cuDNN fuse in the affine transform as well. One idea would be to extract the scale and bias params from the sub-layer in the BN/IN/GN forward pass instead of calling the sub-layer itself, but that feels a tad hacky.

ToucheSir

LGTM!

Remove type restrictions on BN

4335832

darsnack added the breaking label Nov 3, 2022

ToucheSir reviewed Nov 3, 2022

View reviewed changes

Make type parameters prettier for normalization layers

fda24a2

ToucheSir approved these changes Nov 14, 2022

View reviewed changes

ToucheSir added the run downstream test label Nov 14, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Remove type restrictions on normalization layers' scale and bias #2099

Remove type restrictions on normalization layers' scale and bias #2099

darsnack commented Nov 3, 2022

ToucheSir Nov 3, 2022

mcabbott commented Nov 6, 2022 •

edited

Loading

ToucheSir commented Nov 6, 2022 •

edited

Loading

ToucheSir left a comment

Remove type restrictions on normalization layers' scale and bias #2099

Are you sure you want to change the base?

Remove type restrictions on normalization layers' scale and bias #2099

Conversation

darsnack commented Nov 3, 2022

PR Checklist

ToucheSir Nov 3, 2022

Choose a reason for hiding this comment

mcabbott commented Nov 6, 2022 • edited Loading

ToucheSir commented Nov 6, 2022 • edited Loading

ToucheSir left a comment

Choose a reason for hiding this comment

mcabbott commented Nov 6, 2022 •

edited

Loading

ToucheSir commented Nov 6, 2022 •

edited

Loading