You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
I was wondering if you conduct any evaluations on the model performance and voice quality for multi-speaker results, e.g. MOS or sMOS ?
After listening the demos you provided, I found the generated voices for speaker p257 and p250 are quite similar. (I suppose p250-265.wav and p257-243.wav come from different speakers.)
p250
demo/VCTK)/shallow_diffusion_400k/demo_VCTK_shallow_diffusion_400k_p250-265.wav
Hi,
thank you very much for your great work!
I was wondering if you conduct any evaluations on the model performance and voice quality for multi-speaker results, e.g. MOS or sMOS ?
After listening the demos you provided, I found the generated voices for speaker p257 and p250 are quite similar. (I suppose p250-265.wav and p257-243.wav come from different speakers.)
p250
demo/VCTK)/shallow_diffusion_400k/demo_VCTK_shallow_diffusion_400k_p250-265.wav
p257
demo/VCTK)/shallow_diffusion_400k/demo_VCTK_shallow_diffusion_400k_p257-243.wav
Could you please give me a hint?
Thanks!
The text was updated successfully, but these errors were encountered: