-
Notifications
You must be signed in to change notification settings - Fork 122
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Text extractor #878
Merged
UppuluriKalyani
merged 2 commits into
UppuluriKalyani:main
from
Somyajain2004:TextExtractor
Nov 10, 2024
Merged
Text extractor #878
UppuluriKalyani
merged 2 commits into
UppuluriKalyani:main
from
Somyajain2004:TextExtractor
Nov 10, 2024
Conversation
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
The latest updates on your projects. Learn more about Vercel for Git ↗︎
|
👋 Thank you for opening this pull request! We appreciate your contribution to improving this project. Your PR is under review, and we'll get back to you shortly. |
Pls Review asap, as GGSOC ends today at 7PM. |
UppuluriKalyani
approved these changes
Nov 10, 2024
🎉🎉 Thank you for your contribution! Your PR #878 has been merged! 🎉🎉 |
Somyajain2004
added a commit
to Somyajain2004/Open-source-Practice
that referenced
this pull request
Nov 10, 2024
PR link : UppuluriKalyani/ML-Nexus#878 Hi GGSOC team, The PR mentioned above has been reviewed and approved by the reviewer but, my GGSOC leaderboard score hasn't updated for this PR. Please see to it as GGSOC ends at 7pm today. @sanjay-kv @MastanSayyad
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
About the project :
I have made a text Extractor using 2 methods, namely, PyTesseract and EasyOCR. Both are powerful python libraries mainly used for Optical Recognition.
While PyTesseract is used for fast computation, EasyOCR comes handy for extracting text even from noisy images.
Screenshots
1.Text Extraction using PyTesseract :
-input :
-output:
-input :
(input file given in different format here, as github doesn't support .jfif )
-output :
Additionally, I added a functionality of converting the string text into PDF and TXT files for both type of Extraction(ie : EasyOCR and Tesseract)