Formatting of extracted text with indentation #442
TheEyesChico
started this conversation in
Ask for help with specific PDFs
Replies: 1 comment
-
Hi @TheEyesChico Appreciate your interest in the library. This functionality is not yet supported but I can see how it will be useful to you and many others. There's already an open issue #10 for tracking this. Request you to follow that. You may also contribute to the development if you have a solution. |
Beta Was this translation helpful? Give feedback.
0 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
Hey guys, finding your utility very useful so kudos to that!
Okay so I'm working on a high level project where I need to put the data from pdf inside dataframes division by division (for extraction purpose). Indentation in this process becomes important because if I'm searching a particular keyword in a document, I need to back-track the indentation from sub sections to top sections.
PDF Data
However the output I get is very structured and indentation usually isn't taken care of.
Output
Is there any way I can get around this problem?
Beta Was this translation helpful? Give feedback.
All reactions