Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

國語辭典詞條 #39

Open
enigmasc opened this issue Nov 17, 2020 · 0 comments
Open

國語辭典詞條 #39

enigmasc opened this issue Nov 17, 2020 · 0 comments

Comments

@enigmasc
Copy link

被convertzz搞得火冒三丈之餘,靜下來看看到底哪裡出錯,發現內建的dictionary.cvs內容雜亂,毫無系統可言,不是用有組織章法的方式建立的詞庫,難怪轉換結果亂現叢生。

繁簡字體的轉換錯誤,大多出在繁體,因為簡體到繁體是一對多;繁體到簡體,由於是多對一,出錯的機率小很多,所以需要大型繁體詞庫來糾錯。建議用教育部的「重編國語辭典」為基礎,建立有系統的糾錯詞庫,免去亂槍打鳥的問題。

最後還是要說:能把繁簡字轉換做好,已經是浩大的工程,別搞複雜百倍的兩岸詞彙轉換了!
國語辭典詞條.zip

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant