Skip to content

Commit 9099020

Browse files
committed
Fix argument max_seq_length (not supported)
1 parent b22dc4d commit 9099020

File tree

11 files changed

+11
-11
lines changed

11 files changed

+11
-11
lines changed

recipes/constitutional-ai/sft/config_anthropic.yaml

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -40,7 +40,7 @@ log_level: info
4040
logging_steps: 5
4141
logging_strategy: steps
4242
lr_scheduler_type: cosine
43-
max_seq_length: 2048
43+
max_length: 2048
4444
max_steps: -1
4545
num_train_epochs: 1
4646
output_dir: data/mistral-7b-sft-constitutional-ai

recipes/constitutional-ai/sft/config_grok.yaml

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -40,7 +40,7 @@ log_level: info
4040
logging_steps: 5
4141
logging_strategy: steps
4242
lr_scheduler_type: cosine
43-
max_seq_length: 2048
43+
max_length: 2048
4444
max_steps: -1
4545
num_train_epochs: 1
4646
output_dir: data/mistral-7b-sft-constitutional-ai

recipes/smollm/sft/config.yaml

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -90,7 +90,7 @@ log_level: info
9090
logging_steps: 5
9191
logging_strategy: steps
9292
lr_scheduler_type: cosine
93-
max_seq_length: 2048
93+
max_length: 2048
9494
max_steps: -1
9595
num_train_epochs: 1
9696
output_dir: data/smollm-360M-instruct-new

recipes/smollm2/sft/config.yaml

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -27,7 +27,7 @@ log_level: info
2727
logging_steps: 5
2828
logging_strategy: steps
2929
lr_scheduler_type: cosine
30-
max_seq_length: 8192
30+
max_length: 8192
3131
max_steps: -1
3232
num_train_epochs: 2
3333
output_dir: data/smollm2-1.7B-sft

recipes/smollm2/sft/config_smol.yaml

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -26,7 +26,7 @@ log_level: info
2626
logging_steps: 5
2727
logging_strategy: steps
2828
lr_scheduler_type: cosine
29-
max_seq_length: 8192
29+
max_length: 8192
3030
max_steps: -1
3131
num_train_epochs: 2
3232
output_dir: data/smollm2-360M-sft

recipes/starchat2-15b/sft/config_v0.1.yaml

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -87,7 +87,7 @@ log_level: info
8787
logging_steps: 5
8888
logging_strategy: steps
8989
lr_scheduler_type: cosine
90-
max_seq_length: 2048
90+
max_length: 2048
9191
max_steps: -1
9292
num_train_epochs: 3
9393
output_dir: data/starchat2-15b-v0.1

recipes/zephyr-7b-beta/sft/config_full.yaml

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -39,7 +39,7 @@ log_level: info
3939
logging_steps: 5
4040
logging_strategy: steps
4141
lr_scheduler_type: cosine
42-
max_seq_length: 2048
42+
max_length: 2048
4343
max_steps: -1
4444
num_train_epochs: 1
4545
output_dir: data/zephyr-7b-sft-full

recipes/zephyr-7b-beta/sft/config_qlora.yaml

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -54,7 +54,7 @@ log_level: info
5454
logging_steps: 5
5555
logging_strategy: steps
5656
lr_scheduler_type: cosine
57-
max_seq_length: 2048
57+
max_length: 2048
5858
max_steps: -1
5959
num_train_epochs: 1
6060
output_dir: data/zephyr-7b-sft-qlora

recipes/zephyr-7b-gemma/sft/config_full.yaml

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -42,7 +42,7 @@ log_level: info
4242
logging_steps: 5
4343
logging_strategy: steps
4444
lr_scheduler_type: cosine
45-
max_seq_length: 2048
45+
max_length: 2048
4646
max_steps: -1
4747
num_train_epochs: 3
4848
output_dir: data/zephyr-7b-gemma-sft

scripts/sft.py

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -24,7 +24,7 @@
2424
--learning_rate 2.0e-5 \
2525
--num_train_epochs 1 \
2626
--packing \
27-
--max_seq_length 4096 \
27+
--max_length 4096 \
2828
--per_device_train_batch_size 2 \
2929
--gradient_accumulation_steps 8 \
3030
--gradient_checkpointing \

0 commit comments

Comments
 (0)