-
-
Notifications
You must be signed in to change notification settings - Fork 1.3k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[WIP] add support for mixtral #145
base: main
Are you sure you want to change the base?
Conversation
709d2e3
to
d54e095
Compare
Fantastic and fabulous work @tohrnii!!! Super appreciate it! I will take a look later today! |
e2d6b62
to
ff31b00
Compare
Any update about this pull request? |
@kaykyr @danielhanchen @tohrnii You guys open to some collaboration on this? I think I just my Phi2 implementation done (big touch wood) so I'm happy to take a look |
Apologies, I got stuck on something else. I'd love to collaborate @cm2435. If however you are close to completing the implementation, I'm happy to close this PR in favor of yours. |
For sure! I am trying to fine tune a MoE pretrained if I had progress I will create pull requests guys. I also able to offer my small server (2x RTX 3090 with NVLink + i9 11900HK + 64GB DDR4) for collaborators who wanna run tests with multi-gpu. |
@kaykyr Oh thanks for the kind offer!!! I'll take up for that offer later in the month :) |
@kaykyr funny you mention that- I've got almost the exact same setup! I'm going to be very sad when they deprecate the SLI bridge as cuda supported hardware |
Great work. Is there any estimate about when this will be merged? |
Mixtral WIP