Is it possible to have a thread where people can submit examples or testcases where the text normalizer is not doing perfectly? #494
huangruizhe
started this conversation in
Ideas
Replies: 1 comment
-
Yes the normalizer definitely has many rough edges. Your first example seems easy to fix, the second is a bit more tricky because it's hard to differentiate between possessive |
Beta Was this translation helpful? Give feedback.
0 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
Thanks for the great work!
Just wondered if we can submit or share ad-hoc test cases to make the text normalizer more robust?
For example, some wrong normalizations are:
2020 Third Quarter => 2023rd quarter
Monroe's financial release => monroe is financial release
Beta Was this translation helpful? Give feedback.
All reactions