Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

说话人会有概率加入到语音生成出来[BUG] #233

Open
only-ns opened this issue May 20, 2024 · 3 comments
Open

说话人会有概率加入到语音生成出来[BUG] #233

only-ns opened this issue May 20, 2024 · 3 comments
Labels
bug Something isn't working

Comments

@only-ns
Copy link

only-ns commented May 20, 2024

image

https://huggingface.co/spaces/fishaudio/fish-speech-1 (webui)
这是推理出来的音频,复现概率很高(这个说话人是为了提高复现概率)
wav

@only-ns only-ns added the bug Something isn't working label May 20, 2024
@Stardust-minus
Copy link
Member

这是因为说话人是以token形势嵌入推理序列的。临时的解决办法是不指定说话人,只使用ref

@leng-yue
Copy link
Member

说话人 OOD 了...

@Stardust-minus
Copy link
Member

说话人 OOD 了...

不认可,集内说话人也有概率

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug Something isn't working
Projects
None yet
Development

No branches or pull requests

3 participants