[Examples] Finetune Falcon 7B and 40B Example #2242

xzrderek · 2023-07-14T22:24:32Z

I've added an example on how to fine-tune Falcon-7B-sharded (for those with less powerful GPUs), Falcon-7B, and Falcon-40B using SkyPilot and wrote a README.md (heavily borrowed from the Vicuna README).

Ideally, the repo referenced in my code, https://github.com/xzrderek/sky-falcon.git, should be a repo under skypilot-org, similar to the sky-llama example, where their code is stored here.

romilbhardwaj

Awesome work @xzrderek! Left some comments. Will follow up offline.

llm/falcon/train.yaml

llm/falcon/README.md

Michaelvll · 2023-07-17T00:16:12Z

Note: one thing to look at in the future https://github.com/AdrianBZG/LLM-distributed-finetune

Co-authored-by: Romil Bhardwaj <[email protected]>

romilbhardwaj

Thanks @xzrderek! Doing a run now. Left some comments.

llm/falcon/train.yaml

llm/falcon/train.py

llm/falcon/README.md

romilbhardwaj · 2023-07-25T22:00:13Z

Training completed - got a well-behaved loss curve 🎉

…ypilot into derek-falcon-example # Conflicts: # llm/falcon/README.md

…falcon-example

romilbhardwaj

Thanks @xzrderek! I pushed some changes to the readme, should be good to go now. Ran a training again, and it seems to work nicely:

Merging may be blocked on #2536.

Michaelvll · 2023-09-11T21:05:26Z

This is awesome! We can have a serving yaml for falcon as well, especially the latest falcon-180B, so people can play with the largest model open-source LLM themselves. ; )

Adding Falcon Example

f7fe44d

romilbhardwaj reviewed Jul 17, 2023

View reviewed changes

xzrderek and others added 7 commits July 18, 2023 21:32

Update llm/falcon/README.md

496f273

Co-authored-by: Romil Bhardwaj <[email protected]>

Training Script

0c79d74

New YAML Script

da4b16b

Adding time and cost

12db085

Adding Image

c983dd0

Small change to README

464822d

Update README.md

98acc64

romilbhardwaj reviewed Jul 25, 2023

View reviewed changes

llm/falcon/train.yaml Outdated Show resolved Hide resolved

llm/falcon/train.py Show resolved Hide resolved

llm/falcon/README.md Outdated Show resolved Hide resolved

llm/falcon/README.md Outdated Show resolved Hide resolved

llm/falcon/README.md Outdated Show resolved Hide resolved

Small Updates

c3cffed

romilbhardwaj mentioned this pull request Jul 26, 2023

[UX] spot launch shows the cost of the controller, not the VM #2312

Closed

xzrderek and others added 9 commits July 26, 2023 17:22

Small changes to pricing

cc7fd05

wip

996ac47

wip

1adc069

Merge branch 'derek-falcon-example' of https://github.com/xzrderek/sk…

502871d

…ypilot into derek-falcon-example # Conflicts: # llm/falcon/README.md

add gpt3

51ec0eb

edits

e74d9cd

lint

8771ed6

Merge branch 'master' of github.com:skypilot-org/skypilot into derek-…

ed30758

…falcon-example

lint

d7c5204

romilbhardwaj approved these changes Sep 9, 2023

View reviewed changes

updates

f5a1f71

romilbhardwaj merged commit 9e115c9 into skypilot-org:master Sep 11, 2023

romilbhardwaj mentioned this pull request Sep 28, 2023

[Example] Falcon finetuning example #2162

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Examples] Finetune Falcon 7B and 40B Example #2242

[Examples] Finetune Falcon 7B and 40B Example #2242

xzrderek commented Jul 14, 2023 •

edited

Loading

romilbhardwaj left a comment

Michaelvll commented Jul 17, 2023

romilbhardwaj left a comment

romilbhardwaj commented Jul 25, 2023

romilbhardwaj left a comment

Michaelvll commented Sep 11, 2023

[Examples] Finetune Falcon 7B and 40B Example #2242

[Examples] Finetune Falcon 7B and 40B Example #2242

Conversation

xzrderek commented Jul 14, 2023 • edited Loading

romilbhardwaj left a comment

Choose a reason for hiding this comment

Michaelvll commented Jul 17, 2023

romilbhardwaj left a comment

Choose a reason for hiding this comment

romilbhardwaj commented Jul 25, 2023

romilbhardwaj left a comment

Choose a reason for hiding this comment

Michaelvll commented Sep 11, 2023

xzrderek commented Jul 14, 2023 •

edited

Loading