Generated outputs sound robotic in some cases! #14

aniketp02 · 2022-06-09T14:17:27Z

Hello!

I have trained GradTTS on quite a few datasets and observed that it produces excellent results on most of the voices, but the generated results on heavy voices sound very robotic! Can anyone explain any possible reasons for this?

I have attached the samples of original and generated voices.

Thank you!

Original male voice

org_sagar.mov

Generated male voice

gen_sagar.mov

Original female voice

org_anupama.mov

Generated female voice

gen_anupama.mov

ivanvovk · 2022-06-09T15:27:13Z

@aniketp02 Hey! Which HiFi-GAN version do you use? Checkpoint we provide in this repo is trained on LJSpeech only.

aniketp02 · 2022-06-10T06:08:43Z

@ivanvovk Thanks for pointing that out; I was indeed using the checkpoint provided in this repo. Using the Universal HiFi-GAN checkpoint certainly improves the results!!

sagar_gen.mov

ivanvovk · 2022-06-10T19:43:35Z

@aniketp02 glad to hear it helped! For even better quality you can finetune HiFi-GAN on outputs of Grad-TTS.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Generated outputs sound robotic in some cases! #14

Generated outputs sound robotic in some cases! #14

aniketp02 commented Jun 9, 2022

ivanvovk commented Jun 9, 2022

aniketp02 commented Jun 10, 2022

ivanvovk commented Jun 10, 2022

Generated outputs sound robotic in some cases! #14

Generated outputs sound robotic in some cases! #14

Comments

aniketp02 commented Jun 9, 2022

ivanvovk commented Jun 9, 2022

aniketp02 commented Jun 10, 2022

ivanvovk commented Jun 10, 2022