Scanning theory #49

enesdoruk · 2024-12-11T11:47:01Z

Hi, What is the background of scanning half of channel features instead of all channels? The model splits channels by 2, and half of the channels pass through scanning, and the other half use just conv. What if we scan all channels and recalibrate with before scanning features?

ahatamiz · 2024-12-12T15:48:34Z

Hi @enesdoruk the idea is to have the network learn diverse set of features coming from both SSM and non-SSM branches. The SSM branch encodes an implicit inductive bias for pixel dependency where the network does not have access to the entire tokens. However the non-SSM branch removes all such dependencies. This allows the network to not quickly overfit (e.g. some features are easy guesses for example the head or tail of a bird) and learn more robust feature representations.

Hope it helped

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Scanning theory #49

Scanning theory #49

enesdoruk commented Dec 11, 2024

ahatamiz commented Dec 12, 2024

Scanning theory #49

Scanning theory #49

Comments

enesdoruk commented Dec 11, 2024

ahatamiz commented Dec 12, 2024