Anyone can share a 44k pretrain or gives some guide for training 44k from scratch by tiny dataset? #704

ILG2021 · 2025-01-09T10:57:01Z

Checks

This template is only for question, not feature requests or bug reports.
I have thoroughly reviewed the project documentation and read the related paper(s).
I have searched for existing issues, including closed ones, no similar questions.
I confirm that I am using English to submit this report in order to facilitate communication.

Question details

I want to train a 44k model to get a better voice quality, but failed to train. My dataset is about 10 hours, after about 300k updates, the learning rate has deceased to 1e-13, seems not update, increase to 400k still not improved, the voice is clear, But the content is still a mess. I think the model can not learn alignment with tiny dataset. Anyone has a success example?

SWivid · 2025-01-09T11:15:31Z

the learning rate has deceased to 1e-13

set a large epoch number and train longer

ILG2021 · 2025-01-09T11:37:12Z

the learning rate has deceased to 1e-13

set a large epoch number and train longer

Thank. Should I use small model not base model? @SWivid

SWivid · 2025-01-09T11:46:14Z

My dataset is about 10 hours,

smaller size is fine

ILG2021 · 2025-01-10T21:59:14Z

I have set epoch to 100000, and use F5 small model arch.
After 710k updates, the content is still a mess, and the sound become noisy, may be F5 is not suitable for tiny dataset from scratch.

SWivid · 2025-01-11T07:22:16Z

I have set epoch to 100000, and use F5 small model arch.
After 710k updates

have you reset the epoch and restart the training or continue from previous one?
how is the learning rate curve

ILG2021 · 2025-01-11T10:54:04Z

I create a new project, not resume to train. the learning rate curve is ok. because I set epoch to 100000

SWivid · 2025-01-11T10:57:03Z

how is the batchsize, e.g. batch_size_per_gpu and gpu numbers?

for reference: we use default setting as in yaml file for small model to train with 24 hours LJSpeech

ILG2021 · 2025-01-12T04:57:03Z

{
"exp_name": "F5TTS_Small",
"learning_rate": 7.5e-05,
"batch_size_per_gpu": 4800,
"batch_size_type": "frame",
"max_samples": 64,
"grad_accumulation_steps": 1,
"max_grad_norm": 1,
"epochs": 100000,
"num_warmup_updates": 300,
"save_per_updates": 10000,
"last_per_steps": 10000,
"finetune": false,
"file_checkpoint_train": "",
"tokenizer_type": "char",
"tokenizer_file": "",
"mixed_precision": "fp16",
"logger": "tensorboard",
"bnb_optimizer": false
}

This is the settings, I have only 1 4080 GPU

SWivid · 2025-01-12T08:07:24Z

{
"exp_name": "F5TTS_Small",
"batch_size_per_gpu": 4800,
"batch_size_type": "frame",
"grad_accumulation_steps": 1,
}
This is the settings, I have only 1 4080 GPU

equals to 10k updates with default setting as in yaml file for small model to train with 24 hours LJSpeech

for reference, under batch_size_per_gpu: 38400 # 8 GPUs, 8 * 38400 = 307200 setting, 100k updates to get ok results, 200k updates really good ones.

it surely takes some time to train from scratch

ILG2021 · 2025-01-12T10:13:13Z

{
"exp_name": "F5TTS_Small",
"batch_size_per_gpu": 4800,
"batch_size_type": "frame",
"grad_accumulation_steps": 1,
}
This is the settings, I have only 1 4080 GPU

equals to 10k updates with default setting as in yaml file for small model to train with 24 hours LJSpeech

for reference, under batch_size_per_gpu: 38400 # 8 GPUs, 8 * 38400 = 307200 setting, 100k updates to get ok results, 200k updates really good ones.

it surely takes some time to train from scratch

you mean I need 100k*307200/4800 = 6400k?

ILG2021 · 2025-01-14T23:08:49Z

@SWivid

ILG2021 · 2025-01-15T00:49:17Z

how is the batchsize, e.g. batch_size_per_gpu and gpu numbers?

for reference: we use default setting as in yaml file for small model to train with 24 hours LJSpeech

Hello, how many steps do you train and how many hours do you cost?

SWivid · 2025-01-15T05:35:44Z

check previous response
each 100k 8 hours on 8*h100

ILG2021 added the question Further information is requested label Jan 9, 2025

ILG2021 closed this as completed Jan 9, 2025

ILG2021 reopened this Jan 10, 2025

ZhikangNiu closed this as completed Jan 17, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Anyone can share a 44k pretrain or gives some guide for training 44k from scratch by tiny dataset? #704

Anyone can share a 44k pretrain or gives some guide for training 44k from scratch by tiny dataset? #704

ILG2021 commented Jan 9, 2025 •

edited

Loading

SWivid commented Jan 9, 2025

ILG2021 commented Jan 9, 2025 •

edited

Loading

SWivid commented Jan 9, 2025

ILG2021 commented Jan 10, 2025 •

edited

Loading

SWivid commented Jan 11, 2025

ILG2021 commented Jan 11, 2025

SWivid commented Jan 11, 2025

ILG2021 commented Jan 12, 2025 •

edited

Loading

SWivid commented Jan 12, 2025 •

edited

Loading

ILG2021 commented Jan 12, 2025

ILG2021 commented Jan 14, 2025

ILG2021 commented Jan 15, 2025

SWivid commented Jan 15, 2025

Anyone can share a 44k pretrain or gives some guide for training 44k from scratch by tiny dataset? #704

Anyone can share a 44k pretrain or gives some guide for training 44k from scratch by tiny dataset? #704

Comments

ILG2021 commented Jan 9, 2025 • edited Loading

Checks

Question details

SWivid commented Jan 9, 2025

ILG2021 commented Jan 9, 2025 • edited Loading

SWivid commented Jan 9, 2025

ILG2021 commented Jan 10, 2025 • edited Loading

SWivid commented Jan 11, 2025

ILG2021 commented Jan 11, 2025

SWivid commented Jan 11, 2025

ILG2021 commented Jan 12, 2025 • edited Loading

SWivid commented Jan 12, 2025 • edited Loading

ILG2021 commented Jan 12, 2025

ILG2021 commented Jan 14, 2025

ILG2021 commented Jan 15, 2025

SWivid commented Jan 15, 2025

ILG2021 commented Jan 9, 2025 •

edited

Loading

ILG2021 commented Jan 9, 2025 •

edited

Loading

ILG2021 commented Jan 10, 2025 •

edited

Loading

ILG2021 commented Jan 12, 2025 •

edited

Loading

SWivid commented Jan 12, 2025 •

edited

Loading