Implementation performance #16

LeyuanQu · 2023-01-12T06:12:43Z

Hi,
thank you very much for your great work!

I was wondering if you conduct any evaluations on the model performance and voice quality for multi-speaker results, e.g. MOS or sMOS ?

After listening the demos you provided, I found the generated voices for speaker p257 and p250 are quite similar. (I suppose p250-265.wav and p257-243.wav come from different speakers.)
p250
demo/VCTK)/shallow_diffusion_400k/demo_VCTK_shallow_diffusion_400k_p250-265.wav

p257
demo/VCTK)/shallow_diffusion_400k/demo_VCTK_shallow_diffusion_400k_p257-243.wav

Could you please give me a hint?

Thanks!

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Implementation performance #16

Implementation performance #16

LeyuanQu commented Jan 12, 2023

Implementation performance #16

Implementation performance #16

Comments

LeyuanQu commented Jan 12, 2023