Fails to get the last row from tables #742
sksoumik
started this conversation in
Ask for help with specific PDFs
Replies: 1 comment 1 reply
-
Hi @sksoumik Appreciate your interest in the library. For this particular PDF, you can use the table settings as {
"vertical_strategy": "lines_strict",
"horizontal_strategy": "explicit",
"explicit_horizontal_lines": [edge for edge in page.curves+page.edges if edge["height"] > 15 or edge["width"] > 15],
} The output will be
|
Beta Was this translation helpful? Give feedback.
1 reply
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
With this pdf, I am trying to extract all the tables. I have tried with different settings, but I can't make it work. It always misses the last row from the tables. Am I missing something? I have seen this discussion but it doesn't work for me.
Sample code I tried:
It always fails to get the last row from all tables.
Output that I get:
Beta Was this translation helpful? Give feedback.
All reactions