Replies: 5 comments 1 reply
-
Can you please be more specific? Which tool to use? How to decide if at all to OCR? ... |
Beta Was this translation helpful? Give feedback.
-
Just convert image pdf to text bi-layer pdf with good layout |
Beta Was this translation helpful? Give feedback.
-
Try OCRmyPDF. If you want an integration within your PyMuPDF script use the integrated Tesseract access. There are example scripts in the utilities repo. |
Beta Was this translation helpful? Give feedback.
-
Tesseract is hard to train and is low in accuracy. I want to change to paddleocr, any method? |
Beta Was this translation helpful? Give feedback.
-
What is the suggested approach for ocr pdf?
Beta Was this translation helpful? Give feedback.
All reactions