Replies: 1 comment
-
The paper for SOLAR 10.7B indirectly provides a formula for upscaling from 7B (or 7.2B) to 10.7B (or 11B). |
Beta Was this translation helpful? Give feedback.
0 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
How to create a model for fine-tuning through multiple passthrough merges?
I want to create 5b by merging qwen2.5 1.5b, can anyone guide me?
Beta Was this translation helpful? Give feedback.
All reactions