Pdfplumber misses first column and last row for all tables within a schematic #544
PabloABCD
started this conversation in
Ask for help with specific PDFs
Replies: 1 comment 1 reply
-
Hi @PabloABCD Appreciate your interest in the library. Using the following table settings worked for me {
"vertical_strategy": "explicit",
"horizontal_strategy": "explicit",
"explicit_vertical_lines": page.curves+page.edges,
"explicit_horizontal_lines": page.curves+page.edges,
"intersection_tolerance": 15,
}
|
Beta Was this translation helpful? Give feedback.
1 reply
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
PdfPlumber does not extract the first column and the last row of every table in document. I have tried to tweak several configuration parameters in table_settings variable, unluckily I haven't been able to achieve any better result (in my case, the rest of the chars in the schematic is considered as a table in case I use "text" instead of "lines").
Any help with this? I am using Python 3.9.8 and the pdf for testing can be found in: schematic.pdf
The source code for extracting first page is:
Thank you a lot for your help and your impressive library.
Beta Was this translation helpful? Give feedback.
All reactions