Replies: 1 comment 1 reply
-
@ydhongHIT I have a related idea The plan is to have a ResNet backbone, either separate or part of the current one that allows defining models where specific blocks use a MHA/MHA-like attention module in place of the 3x3.
and there are some other proposed attn modules that'd work in that manner... |
Beta Was this translation helpful? Give feedback.
-
Thanks for your great work. Do you have the plan to implement the models in this paper (https://arxiv.org/pdf/2101.11605.pdf)?
Beta Was this translation helpful? Give feedback.
All reactions