PDF returns no text, tables, or anything #717
toakleyy
started this conversation in
Ask for help with specific PDFs
Replies: 1 comment 2 replies
-
Hi @toakleyy, and thanks for your interest in this library. From what you've shared, it appears that your PDF is a scanned document, not a born-digital PDF. (A quick way to test this: Can you select / copy / paste any text within the document?) If that's the case, you'll first want to run optical character recognition on the PDF. Then you can extract the OCR-detected text with |
Beta Was this translation helpful? Give feedback.
2 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
I have used pdfplumber with success on a separate PDF, but now when I try the same function on this new PDF I get absolutely nothing. I'm simply trying to extract text. See attached PDF here (I have blocked out personal info, but it is not blocked out when I try to plumb it) -
RJ PDF markup.pdf
Some code I am using that isn't working:
`
My output looks like this:
page 0
page 1
page 2
So it scans each page, but doesn't actually come up with any text.
Any help would be much appreciated. Thank you.
Beta Was this translation helpful? Give feedback.
All reactions