Add llama-2 template support for fine-tuning #2423

karthik19967829 · 2023-09-14T03:47:50Z

Make llama-2 template as default to support fine-tuning llama-2 models better

Why are these changes needed?

This changes changes the template in train.py to llama-2 . From my experiments I reached better convergence and starting loss with llama-2 template for the llama-2 7B model than the vicuna template . Considering a significant part of the community is using llama-2 it might be good to make it the default template , or have a mechanism to support llama-2

Related issue number (if applicable)

Fixes #2043

Checks

I've run format.sh to lint the changes in this PR.
I've included any doc changes needed.
I've made sure the relevant tests are passing (if applicable).

Make llama-2 template as default to support fine-tuning llama-2 models better

merrymercy · 2023-09-18T01:36:27Z

Can we make this an argument conversation_template in

FastChat/fastchat/train/train.py

Line 53 in c7e3e67

class TrainingArguments(transformers.TrainingArguments):

? Let us keep the vicuna as the default one and gradually move to llama-2 as the default
Could you fix the format?

karthik19967829 · 2023-09-18T06:43:55Z

Sure will make these changes

merrymercy · 2023-11-05T02:07:09Z

closed due to inactivity

Update train.py

7475455

Make llama-2 template as default to support fine-tuning llama-2 models better

merrymercy force-pushed the main branch 2 times, most recently from 14c0818 to e4758da Compare September 19, 2023 00:32

merrymercy force-pushed the main branch from cc83153 to 8e8a604 Compare September 29, 2023 04:57

karthik19967829 mentioned this pull request Sep 30, 2023

Llama-2 loss and learning rate is always 0 after first step #2072

Open

merrymercy force-pushed the main branch from b6bf6b7 to 125f374 Compare October 10, 2023 20:35

lwaekfjlk mentioned this pull request Oct 26, 2023

Added sotopia template sotopia-lab/sotopia-pi#79

Closed

6 tasks

merrymercy closed this Nov 5, 2023

akujuou-sony mentioned this pull request Jan 18, 2024

Ask about the usage of template #2517

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add llama-2 template support for fine-tuning #2423

Add llama-2 template support for fine-tuning #2423

karthik19967829 commented Sep 14, 2023

merrymercy commented Sep 18, 2023

karthik19967829 commented Sep 18, 2023

merrymercy commented Nov 5, 2023

Add llama-2 template support for fine-tuning #2423

Add llama-2 template support for fine-tuning #2423

Conversation

karthik19967829 commented Sep 14, 2023

Why are these changes needed?

Related issue number (if applicable)

Checks

merrymercy commented Sep 18, 2023

karthik19967829 commented Sep 18, 2023

merrymercy commented Nov 5, 2023