Two different words are getting merged while extract text from PDF #800
sarveshkrg
started this conversation in
Ask for help with specific PDFs
Replies: 1 comment
-
Hi @sarveshkrg, and thanks for your interest in this library. Have you tried adjusting the |
Beta Was this translation helpful? Give feedback.
0 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
Two different words are getting merged while extracting text from PDF.
Example
Actual text is 'the full' but while reading it using extract_text method we are getting 'thefull'.
Code.
import pdfplumber
with pdfplumber.open("stayveer-epar-product-information_en.pdf") as pdf:
page = pdf.pages[1]
print(page.extract_text())
PDF screenshot
PDF file used
stayveer-epar-product-information_en.pdf
Beta Was this translation helpful? Give feedback.
All reactions