-
-
Notifications
You must be signed in to change notification settings - Fork 424
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Small paper ideas to be added #262
Comments
@RyanKim17920 so the first paper is already in the repository and even cited i do like the second paper, and can try it out before adding it the third paper, i like as well, but may be outside the scope of this repo |
@RyanKim17920 someone also shared with me https://arxiv.org/abs/2312.07987 which could be an improvement from MoA |
@RyanKim17920 the switchhead paper is pretty good will run the experiments tomorrow morning, and if all goes well, it will probably in the repository by week's end |
@lucidrains What do you think of https://www.arxiv.org/abs/2408.14915, in particular the DRA activation function for Continuous Transformers? |
@lucidrains If you confirm, I can also open a PR for DRA. |
@Baran-phys hey Baran, thanks for sharing your paper. it is interesting but i will probably not accept as it is not relevant for this repository. periodic activation functions is something i've been meaning to look into once the right problem presents |
Here's some papers I've read that would be nice to have, I'll try to implement them if I can:
https://arxiv.org/pdf/2010.04245
https://arxiv.org/abs/2210.05144
(Probably should add FFN MoE as well)
https://arxiv.org/pdf/2404.02258
(Probably will be hard to make work with other features)
The text was updated successfully, but these errors were encountered: