-
Notifications
You must be signed in to change notification settings - Fork 207
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
question about training loss and inference performance #61
Comments
You probably need to fine-tune your bottleneck dimensions. |
Do you think I should enlarge the bottleneck dimension or decrease the bottleneck dimension? |
There's detailed information in the paper on how to tune the bottleneck. |
OK, thank you~ |
the paper said: But for new dataset, how to choose the hparams? And wheather we should use DANN idea? |
@zzw922cn Call you tell me which dataset you used and the batch size of training process ? Thanks in advance !! |
Hi, thank you for your very nice work! I have rerun this project, and it has run 90K steps. the loss_id_psnt is around 0.07. And I tried to feed into a in-domain speaker's melspec and his speaker embedding as source embedding, and another speaker's speaker embedding as target speaker embedding. Then I use GL vocoder to generate the wav, I found the voice is still of the source speaker. Is this normal? When can I perform voice conversion successfully? at what step or what's the loss_id_psnt? thank you very much!!
The text was updated successfully, but these errors were encountered: