Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

您好,可否提供一下您的语言模型? #13

Open
PPPPPYJ opened this issue Dec 26, 2018 · 3 comments
Open

您好,可否提供一下您的语言模型? #13

PPPPPYJ opened this issue Dec 26, 2018 · 3 comments

Comments

@PPPPPYJ
Copy link

PPPPPYJ commented Dec 26, 2018

您好,非常感谢您所做的工作,我直接采用了thchs30附带的语言模型,但是我在测试中发现总是有如下的错误,估计可能是语言模型或者词典不匹配,可否请您提供一下您测试成功的这两个文件?或者您用thchs30自带的语言模型和词典测试一下,告诉我文件所在的具体位置或者所需的修改,谢谢您!再一次感谢您对这项工作的辛勤付出!

Traceback (most recent call last):
File "utils/prep_ctc_trans.py", line 75, in
out_line += dict[unk_word] + ' '
KeyError: ''
Traceback (most recent call last):
File "utils/prep_ctc_trans.py", line 75, in
out_line += dict[unk_word] + ' '
KeyError: ''

@Sundy1219
Copy link
Owner

建议你自己准备个建模单元文件units.txt和相应的字典lexicon.txt,
lexicon.txt
units.txt
语言模型的生成是单独训练的,但是训练的词是要在这个词典里的,然后根据这个建模单元,lexicon.txt,和lm.gz生成相应的TLG.fst

@Sundy1219
Copy link
Owner

有什么问题,可以问这位小伙伴@fantasyoooo,你遇到的问题,我都给他解答了.在issues中可以找到的问题

@liubin3702
Copy link

请问你在这个项目中的问题解决了吗

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants