You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Prices are prefixed with the ¤ symbol. This symbol is not in the english training set of Tesseract and is read as a random character. When this character is read as a digit, it inflates the prices read by an order of magnitude, i.e. ¤900 becomes 2900.
Tasks
Experiment with heuristics to mitigate the issue
Thousands are always separated by a comma , and groups of digit are only up to 3 long
Only 1 digit is present before the comma , when the price is listed in kilo units K
Others
Results
The ¤ character doesn't inflate prices
The text was updated successfully, but these errors were encountered:
Situation
Prices are prefixed with the
¤
symbol. This symbol is not in the english training set of Tesseract and is read as a random character. When this character is read as a digit, it inflates the prices read by an order of magnitude, i.e.¤900
becomes2900
.Tasks
,
and groups of digit are only up to 3 long,
when the price is listed in kilo unitsK
Results
¤
character doesn't inflate pricesThe text was updated successfully, but these errors were encountered: