Training process and loss understanding #81

SanketDhuri · 2023-10-23T05:56:19Z

We have dataset of 5000-6000 audios of duration >= 2sec but still on training on this dataset for 15 epochs it doesn’t reaches satisfactory loss what is the issue here ?

eg last observed loss are :
Semantic (train loss = 0.00309 ) (val loss = 1.28167 )
Coarse loss (train loss = 0.057) (val loss = 3.2796 )
Fine loss (train loss = 0.1 ) (val loss = 1.18 )

Why does sometimes it itself adds or skips the word, sometime it also generates voice that has shivering tone?
Why it gives problems for punctuation like ! or .
We are training with mixed dataset different speaker but they have same way speaking can it create problem in training?
On finetuning pretrained prompts like [laughs],etc are lost what might be the reason?
We are preparing dataset’s with help of whisper model so should we also add those manually add those prompts to dataset?
How can add new emotion prompts such sad or excited or unhappy etc.

dagshub · 2023-10-23T05:56:21Z

Join the discussion on DagsHub!

boringtaskai · 2024-04-28T00:16:23Z

Hi how do you use the weight after the training? cause I had issue like this:

raise ValueError(f"missing keys: {missing_keys}")

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Training process and loss understanding #81

Training process and loss understanding #81

SanketDhuri commented Oct 23, 2023

dagshub bot commented Oct 23, 2023

boringtaskai commented Apr 28, 2024

Training process and loss understanding #81

Training process and loss understanding #81

Comments

SanketDhuri commented Oct 23, 2023

dagshub bot commented Oct 23, 2023

boringtaskai commented Apr 28, 2024