BN after Activation, Have you tried it? #598

ademyanchuk · 2021-04-29T11:00:42Z

ademyanchuk
Apr 29, 2021

Hello, Ross,

Thanks for the great work you are doing! As you have such an impressive experience with training so many models with so many more hyperparameters, I am curious, have you ever tried to change only one thing (for models there it is appropriate, e.g. resnet), to put BatchNorm layer after activation. If yes, what was your results?

Thank you in advance and all the best :)

Alexey

rwightman · 2021-04-29T15:47:32Z

rwightman
Apr 29, 2021
Maintainer

@ademyanchuk ResNetV2 nets are 'pre-act' and so are NfNets (but they don't have normalization layers). I've trained NFNets well, haven't really tried training the ResNetV2 but the weights in that file are pretty interesting as they were trained by google researchers investigating transfer learning and in21k pretraining.

https://github.com/rwightman/pytorch-image-models/blob/master/timm/models/resnetv2.py
https://github.com/rwightman/pytorch-image-models/blob/master/timm/models/nfnet.py

1 reply

ademyanchuk May 4, 2021
Author

Thank you :)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

BN after Activation, Have you tried it? #598

{{title}}

Replies: 1 comment 1 reply

{{title}}

{{editor}}'s edit

{{editor}}'s edit

{{title}}

Select a reply

BN after Activation, Have you tried it? #598

ademyanchuk Apr 29, 2021

Replies: 1 comment · 1 reply

rwightman Apr 29, 2021 Maintainer

ademyanchuk May 4, 2021 Author

ademyanchuk
Apr 29, 2021

Replies: 1 comment 1 reply

rwightman
Apr 29, 2021
Maintainer

ademyanchuk May 4, 2021
Author