Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Scanning theory #49

Open
enesdoruk opened this issue Dec 11, 2024 · 1 comment
Open

Scanning theory #49

enesdoruk opened this issue Dec 11, 2024 · 1 comment

Comments

@enesdoruk
Copy link

Hi, What is the background of scanning half of channel features instead of all channels? The model splits channels by 2, and half of the channels pass through scanning, and the other half use just conv. What if we scan all channels and recalibrate with before scanning features?

@ahatamiz
Copy link
Collaborator

Hi @enesdoruk the idea is to have the network learn diverse set of features coming from both SSM and non-SSM branches. The SSM branch encodes an implicit inductive bias for pixel dependency where the network does not have access to the entire tokens. However the non-SSM branch removes all such dependencies. This allows the network to not quickly overfit (e.g. some features are easy guesses for example the head or tail of a bird) and learn more robust feature representations.

Hope it helped

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants