Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

OCR language/page segmentation, Google Cloud Vision and FUNSD format #180

Open
naourass opened this issue Jun 28, 2022 · 2 comments
Open

Comments

@naourass
Copy link

naourass commented Jun 28, 2022

Hello and many thanks for this awesome tool.

For the needs of my current use case, I added these features to my local pawls clone :

  • OCR argument for choosing tesseract language and segmentation type
  • Google Cloud Vision OCR support which may be more accurate than tesseract in some cases (should download credential file from GCP to root folder)
  • Command to generate FUNSD-like json with labels, text and words from coco and tokens csv.

If these features or any of them is relevant to pawls vision, let me know so I can make a PR.

@lolipopshock
Copy link
Collaborator

Sounds great -- looking forward to your PR!

@naourass
Copy link
Author

naourass commented Jul 9, 2022

Hi there, I've just made a first PR. The second one will be following soon with FUNSD-like and IOB export. Let me know if you have any remarks !

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants