Only noise in checkpoint audio. #354

marshoepial · 2020-10-26T17:53:43Z

I'm trying to use my own data for training. I've tested with the LJSpeech dataset, which even after a few thousand steps produces speech-like audio. Yet, training on my dataset (16000 Hz), it comes out as plain noise after even 40,000 steps. I'm assuming this is because of the audio hparams settings, where I changed the sample rate from 20000 to 16000, but I'm not sure what to change them to. For a 20000 hz audio, the length of frames are much shorter than the default setting, and I'm not sure what the frame shift is used for either. Is this something you tune by hand or is there a way to calculate these values? Thanks.

berkaycinci · 2020-12-28T09:47:54Z

I am facing the same problem. Did you find any solution? @DashEightMate

saharsyed · 2021-01-26T11:42:57Z

audio length =max_iters * outputs_per_step * frame_shift_ms

saharsyed · 2021-01-26T11:43:14Z

40,000 seems to be quite less i assume

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Only noise in checkpoint audio. #354

Only noise in checkpoint audio. #354

marshoepial commented Oct 26, 2020

berkaycinci commented Dec 28, 2020

saharsyed commented Jan 26, 2021

saharsyed commented Jan 26, 2021

Only noise in checkpoint audio. #354

Only noise in checkpoint audio. #354

Comments

marshoepial commented Oct 26, 2020

berkaycinci commented Dec 28, 2020

saharsyed commented Jan 26, 2021

saharsyed commented Jan 26, 2021