-
Notifications
You must be signed in to change notification settings - Fork 273
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Fine tuning guide? #57
Comments
Thank you for the kind comments! Good point, we don't have a simple document that explains how to reproduce the Flash model, end-to-end. As a start, the README page found here (https://github.com/NovaSky-AI/SkyThought/tree/main/skythought/tools) describes all of the tools and scripts used to produce both Sky-T1-32B-Preview and Flash, and all necessary code is readily available in the directory. |
working towards tuning first step, basic mechanics of ollama releases fwiw, I created my own ollama release and uploaded to the site, just a shim around prior work including the hf.co link above https://ollama.com/niccolox/sky-t1-flash-cline blog post https://www.niccolox.com/p/355-making-an-ollama-model-release-with-sky-t1-flash-for-cline its really a shim, but I intend to swim upstream and next fine tune, than train a model THANKS! |
btw, this https://github.com/NovaSky-AI/SkyThought/tree/main/skythought/tools gives a 404 |
@niccolox Hi, thanks for sharing! We refactor our codebase in the latest PRs so the |
congratulations on doing this for world peace and the future of humanity
can you give some information, perhaps even a dedicated document on fine tuning Sky-T1-32B-Flash?
fine tuning is an important but often overlooked aspect to LLM use and I think would remove much of the WTF factor when implemented
The text was updated successfully, but these errors were encountered: