False Positive for table extraction #524
pranjal-jaiswal
started this conversation in
Ask for help with specific PDFs
Replies: 1 comment 1 reply
-
Hi @pranjal-jaiswal Appreciate your interest in the library. If you use the table debugger, you'll notice that there are hidden horizontal and vertical lines as you can see in the below output. To remove them, you can refer to the code at #285 (comment) |
Beta Was this translation helpful? Give feedback.
1 reply
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
I am getting a table object on the first page of a pdf and there is not a single vertical or horizontal line on 1st page.
I have attached the code snippet and result for your reference:
Code Snippet:
import pdfplumber
my_pdf = pdfplumber.open("2.pdf")
tables = my_pdf.pages[0].find_tables()
tables[0].extract()
Result:
[["Here's my SSN: 608-18-8954. I have been routing money 6227-8231-4. Ben is from"],
['Action Auto. congratulations on passing your driving test 6940579.. need to see last'],
['10 transaction of card 4539254270870707. male social security number is'],
['768-52-0198. Borbála is very reliable. You can always depend on him.. can you tell'],
['me your medical license number? QO5772175.. Zoolander is a 2001 American'],
['action-comedy film directed by Apolonia Kamińska and starring Kamińska. my name'],
['on the bank account 380106394 is wrong. what do i have to do to get it corrected?'],
['']]
I have also attached the pdf for reference:
2.pdf
Beta Was this translation helpful? Give feedback.
All reactions