exact table extract #640
Larbo53
started this conversation in
Ask for help with specific PDFs
Replies: 1 comment
-
after several tries, I found the solution. Sincerely vertical=(83,170,264,355,420,500) |
Beta Was this translation helpful? Give feedback.
0 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
Hello,
I would like to extract the fields of the table without cutting off the words "Value date" and "Operation date", but also to group all the information linked to a date.
here is the command used :
text = p0.extract_table(table_settings={ "vertical_strategy": "text", "snap_x_tolerance": 40})
the result for the first lines:
[['Value date', 'ur', 'Operation date', 'ation', 'Label', 'Debit', 'Credit', ''], ['31-03-2022', '', '31-03-2022', '', 'SCT REGULATIONNAPETIZ AFFILIES 2. \nNIT-TRD-2027320-\n29032022-01', '', '11,87', ''], ['31-03-2022', '', '31-03-2022', '', 'SCT REGULATIONnAPETIZ AFFILIES 2. \nNIT-TRD-2027320-\n30032022-01', '', '3.29', ''], ['31-03-2022', '', '31-03-2022', '', 'SUBSCRIPTION CGD\nON LINE N4', '-20.00', '', ''],
Thanks for your help.
Sincerely
Translated with www.DeepL.com/Translator (free version)
releve_page1.pdf
Beta Was this translation helpful? Give feedback.
All reactions