Not detecting some vertical lines #836
RyanAquino
started this conversation in
Ask for help with specific PDFs
Replies: 1 comment 7 replies
-
Hi @RyanAquino Appreciate your interest in the library. To get the missing lines, you can use page = pdf.pages[1]
start = min([x['x0'] for x in page.horizontal_edges])
end = max([x['x1'] for x in page.horizontal_edges])
table_settings = {
"vertical_strategy": "explicit",
"horizontal_strategy": "text",
"intersection_x_tolerance": 10,
"explicit_vertical_lines": page.curves+page.edges + [start, end]
} |
Beta Was this translation helpful? Give feedback.
7 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
Hello everyone,
I am not sure if I have the proper table settings, but I am trying to retrieve all the data and format it in a way that I can determine in which table the extracted data came.
Current issue is that, some vertical lines are not being detected properly, although my table setting is already set as
vertical_strategy: lines
.PDF file
05122000.pdf
Lines that was not detected
Beta Was this translation helpful? Give feedback.
All reactions