the project can not beat pypinyin! #9

shawnthu · 2020-10-22T10:16:07Z

No description provided.

seanie12 · 2021-01-12T10:37:18Z

please describe your experimental setup.

In our experimental setup, our model performs better than the other baselines.

JohnHerry · 2021-08-26T02:03:27Z

The pretrained g2pM model is even worse then pypinyin. The count of poly char is too much while the training corpus is too small. But even we had extend the corpus, the result is not so good.

dyustc · 2023-04-06T08:00:36Z

`

p1 = lazy_pinyin(sentence, style=Style.TONE3, neutral_tone_with_five=True)
print('pypinyin lazy')
print(p1)

model = G2pM()
p2 = model(sentence, tone=True, char_split=False)
print('g2m')
print(p2)`

Here is what I found where it may perform worth than pypinyin...
`
然而，他红了20年以后，他在长沙长大，也在长沙退休。

pypinyin lazy

['ran2', 'er2', '，', 'ta1', 'hong2', 'le5', '20', 'nian2', 'yi3', 'hou4', '，', 'ta1', 'zai4', 'chang2', 'sha1', 'zhang3', 'da4', '，', 'ye3', 'zai4', 'chang2', 'sha1', 'tui4', 'xiu1', '。']

g2m

['ran2', 'er2', '，', 'ta1', 'hong2', 'le5', '20', 'nian2', 'yi3', 'hou4', '，', 'ta1', 'zai4', 'chang2', 'sha1', 'chang2', 'da4', '，', 'ye3', 'zai4', 'chang2', 'sha1', 'tui4', 'xiu1', '。']`

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

the project can not beat pypinyin! #9

the project can not beat pypinyin! #9

shawnthu commented Oct 22, 2020

seanie12 commented Jan 12, 2021

JohnHerry commented Aug 26, 2021

dyustc commented Apr 6, 2023

the project can not beat pypinyin! #9

the project can not beat pypinyin! #9

Comments

shawnthu commented Oct 22, 2020

seanie12 commented Jan 12, 2021

JohnHerry commented Aug 26, 2021

dyustc commented Apr 6, 2023