Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

【Bug】文本中如果有数字和中文混合的内容数字不能正确的合成 #412

Open
guozanhua218 opened this issue Jun 23, 2024 · 3 comments
Labels
bug Something isn't working documentation Improvements or additions to documentation

Comments

@guozanhua218
Copy link

如标题所述

@fumiama fumiama added bug Something isn't working documentation Improvements or additions to documentation labels Jun 24, 2024
@fumiama
Copy link
Member

fumiama commented Jun 24, 2024

请尝试正确安装WeTextProcessing包,该包就是为了解决此问题。当然更好的办法还是让模型支持数字。

@jianchang512
Copy link

jianchang512 commented Jun 24, 2024

中文的数字处理可以考虑使用百度的PaddlePaddle text normlization代码

目前集成的wetext只可conda下才能安装,兼容性不好

https://github.com/PaddlePaddle/PaddleSpeech/tree/develop/paddlespeech/t2s/frontend/zh_normalization

@fumiama
Copy link
Member

fumiama commented Jun 24, 2024

考虑将normlization做成接口,库中不再包含任一实现。

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug Something isn't working documentation Improvements or additions to documentation
Projects
Status: Further Goals
Development

No branches or pull requests

3 participants