-
Notifications
You must be signed in to change notification settings - Fork 9.6k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Incorrect bounding boxes #2264
Comments
Its already copy/pasted above the image :) |
We do not support 3rd party sw/project: please provide test case with tesseract executable or simple c++/c test code. |
I just use the API.. Have used an eample from the wiki
|
And something more.. In the footer |
Check your file https://imgur.com/a/12qqobk - seems like problem with your prepocesing and not tesseract ;-) |
ohh.. my bad :) |
Based on my test it is caused by |
Ok, but how would that affect my results when the language is danish? |
This is really strange, but I play with my test case and now I am not able to reproduce my results from #2264 (comment)... |
strange.. but I experience pretty often that the bounding box is incorrect |
What will Will it use the legacy engine + LSTM engine? If so.. then have to download the legacy traindata too? |
Just a quick reply: I use tessdata_best repository and I did not get any error. |
ok thanks.. but why have you linked this issue with "Tesseract 4.0 hangs when processing a particular image #2288" ? This issue is only about incorrect bounding boxes I have another example where the bounding box is incorrect And I could go on :) There are really a problem with the bounding boxes. In 50% of all results one or more bounding box is not off by a little - but alot off.. The example below has at least one bounding box off by
The last column
|
Another question.. Isn't here an option where you can tell the OCR engine ONLY to recognize horizontal text and not to try to autodetect the orientation and not trying to recognize vertical text? |
This is not support page! Please respect guidelines for posting issue: use tesseract user forum for asking questions/support. |
I'm so sorry about that But what about the bug? |
That was pretty arrogant... You also just closed this issue which obviosly was a bug (2-3 of the most contributing developers on this project even confirmed it was a bug) |
No it was not arrogant: You can ask for support on user forum. Maybe somebody will be willing to help you for free. There are also several (paid) developers who did what you try to do exactly. But they will not share their knowledge for free. |
But why doesn't it output the correct bounding box with LSTM engine only? There must be some inconsistency in the code.. The second example shows the same problem with incorrect bounding boxes and I use OEM_DEFAULT as you suggested.. |
And about the API code that returns I'm not asking for free support directly.. Just if you had a quick work-around (a command line parameter) And about the API.. Why doesn't it come clear of the API docs how to initiate tesseract correctly..? The error occurs while the API is initiated.. not in my code If it really isn't a bug then there should be an example how to initate the API correctly to handle internal tesseract errors.. |
I can see there is a difference between the outputs (original vs preprocessed) but there is no distortion between the two boxes that connect the two lines.. but they intersect in the output Of course the images are not completely equal.. There is a small variation in the pixels.. but not a variation that should change the bounding boxes in the manner it does.. Telling the users that only one specific image would work (or use a different parameter which in teori do the same work) and not others is just a poor way to ignore there is a problem.. If the arrangement of the pixels make the base of something you can predict and the program doesn't output what you predict = a bug The better tesseract works and the more robust it is is best for everybody.. I think its a great program and thats why I want to contribute with my experiences to make it even better |
I use the latest release
4.1
with LSTM only and with best traindata fileshttps://github.com/tesseract-ocr/tesseract/archive/4.1.0-rc1.tar.gz
Just to give an example
The two amounts to the right
528,00
and72,00
overlap each other in the OCR results but does not overlap in the input imageHere is a link the the preprocessed image (tiff) before sending it to tesseract
https://imgur.com/a/12qqobk
They intersect with
10 px
(1353 - 1343) even though they are far from each otherBounding box for
528,00
:Bounding box for
72,00
:The text was updated successfully, but these errors were encountered: