Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Text extractor #878

Merged
merged 2 commits into from
Nov 10, 2024
Merged

Conversation

Somyajain2004
Copy link
Contributor

@Somyajain2004 Somyajain2004 commented Nov 10, 2024

About the project :

I have made a text Extractor using 2 methods, namely, PyTesseract and EasyOCR. Both are powerful python libraries mainly used for Optical Recognition.
While PyTesseract is used for fast computation, EasyOCR comes handy for extracting text even from noisy images.

Screenshots

1.Text Extraction using PyTesseract :
-input :
Test_img_tessseract
-output:
tesseract_output

  1. Text Extraction using EasyOCR :
    -input :
    {0FBB66D0-2121-417A-84C9-01C0D97586AD}
    (input file given in different format here, as github doesn't support .jfif )
    -output :
    easyOCR_output

Additionally, I added a functionality of converting the string text into PDF and TXT files for both type of Extraction(ie : EasyOCR and Tesseract)

Copy link

vercel bot commented Nov 10, 2024

The latest updates on your projects. Learn more about Vercel for Git ↗︎

Name Status Preview Comments Updated (UTC)
ml-nexus ✅ Ready (Inspect) Visit Preview 💬 Add feedback Nov 10, 2024 0:18am

Copy link

👋 Thank you for opening this pull request! We appreciate your contribution to improving this project. Your PR is under review, and we'll get back to you shortly.
Don't forget to mention the issue you solved!

@Somyajain2004
Copy link
Contributor Author

Pls Review asap, as GGSOC ends today at 7PM.

@UppuluriKalyani UppuluriKalyani merged commit bb0b356 into UppuluriKalyani:main Nov 10, 2024
4 checks passed
Copy link

🎉🎉 Thank you for your contribution! Your PR #878 has been merged! 🎉🎉

Somyajain2004 added a commit to Somyajain2004/Open-source-Practice that referenced this pull request Nov 10, 2024
PR link : UppuluriKalyani/ML-Nexus#878

Hi GGSOC team,
The PR mentioned above has been reviewed and approved by  the reviewer but, my GGSOC leaderboard score hasn't updated for this PR.
Please see to it as GGSOC ends at 7pm today.

@sanjay-kv @MastanSayyad
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants