Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Small paper ideas to be added #262

Open
RyanKim17920 opened this issue Jul 25, 2024 · 6 comments
Open

Small paper ideas to be added #262

RyanKim17920 opened this issue Jul 25, 2024 · 6 comments

Comments

@RyanKim17920
Copy link

Here's some papers I've read that would be nice to have, I'll try to implement them if I can:

https://arxiv.org/pdf/2010.04245

https://arxiv.org/abs/2210.05144
(Probably should add FFN MoE as well)

https://arxiv.org/pdf/2404.02258
(Probably will be hard to make work with other features)

@lucidrains
Copy link
Owner

lucidrains commented Jul 26, 2024

@RyanKim17920 so the first paper is already in the repository and even cited

i do like the second paper, and can try it out before adding it

the third paper, i like as well, but may be outside the scope of this repo

@lucidrains
Copy link
Owner

@RyanKim17920 someone also shared with me https://arxiv.org/abs/2312.07987 which could be an improvement from MoA

@lucidrains
Copy link
Owner

@RyanKim17920 the switchhead paper is pretty good

will run the experiments tomorrow morning, and if all goes well, it will probably in the repository by week's end

@Baran-phys
Copy link

Baran-phys commented Oct 13, 2024

@lucidrains What do you think of https://www.arxiv.org/abs/2408.14915, in particular the DRA activation function for Continuous Transformers?

@Baran-phys
Copy link

@lucidrains If you confirm, I can also open a PR for DRA.

@lucidrains
Copy link
Owner

@Baran-phys hey Baran, thanks for sharing your paper.

it is interesting but i will probably not accept as it is not relevant for this repository. periodic activation functions is something i've been meaning to look into once the right problem presents

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants