Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[Examples] Finetune Falcon 7B and 40B Example #2242

Merged
merged 19 commits into from
Sep 11, 2023

Conversation

xzrderek
Copy link
Contributor

@xzrderek xzrderek commented Jul 14, 2023

I've added an example on how to fine-tune Falcon-7B-sharded (for those with less powerful GPUs), Falcon-7B, and Falcon-40B using SkyPilot and wrote a README.md (heavily borrowed from the Vicuna README).

Ideally, the repo referenced in my code, https://github.com/xzrderek/sky-falcon.git, should be a repo under skypilot-org, similar to the sky-llama example, where their code is stored here.

Copy link
Collaborator

@romilbhardwaj romilbhardwaj left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Awesome work @xzrderek! Left some comments. Will follow up offline.

llm/falcon/train.yaml Outdated Show resolved Hide resolved
llm/falcon/train.yaml Outdated Show resolved Hide resolved
llm/falcon/train.yaml Outdated Show resolved Hide resolved
llm/falcon/README.md Outdated Show resolved Hide resolved
llm/falcon/README.md Show resolved Hide resolved
llm/falcon/README.md Outdated Show resolved Hide resolved
@Michaelvll
Copy link
Collaborator

Note: one thing to look at in the future https://github.com/AdrianBZG/LLM-distributed-finetune

Copy link
Collaborator

@romilbhardwaj romilbhardwaj left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Thanks @xzrderek! Doing a run now. Left some comments.

llm/falcon/train.yaml Outdated Show resolved Hide resolved
llm/falcon/train.py Show resolved Hide resolved
llm/falcon/README.md Outdated Show resolved Hide resolved
llm/falcon/README.md Outdated Show resolved Hide resolved
llm/falcon/README.md Outdated Show resolved Hide resolved
@romilbhardwaj
Copy link
Collaborator

Training completed - got a well-behaved loss curve 🎉
image

Copy link
Collaborator

@romilbhardwaj romilbhardwaj left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Thanks @xzrderek! I pushed some changes to the readme, should be good to go now. Ran a training again, and it seems to work nicely:

image

Merging may be blocked on #2536.

@romilbhardwaj romilbhardwaj merged commit 9e115c9 into skypilot-org:master Sep 11, 2023
@Michaelvll
Copy link
Collaborator

This is awesome! We can have a serving yaml for falcon as well, especially the latest falcon-180B, so people can play with the largest model open-source LLM themselves. ; )

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants