Skip to content
This repository has been archived by the owner on Jan 20, 2021. It is now read-only.

Comments column not extracted #105

Open
CMCDragonkai opened this issue Jan 3, 2016 · 3 comments
Open

Comments column not extracted #105

CMCDragonkai opened this issue Jan 3, 2016 · 3 comments

Comments

@CMCDragonkai
Copy link

This worked quite well for all the columns and rows, but for some reason the comments column wasn't extracted (it's all text of course). It looked like this:

screenshot at 04-23-11

@jeremybmerrill
Copy link
Member

@CMCDragonkai, there's a known issue (related to #99) where the detected selection area is just a little bit too small. If you expand the selection on the right just a little bit -- so it includes the border, not just the text -- then the comments column should be extracted too.

@CMCDragonkai
Copy link
Author

I see, does is the bounding box detection based on image processing or something else? Surely there must be a way to match what is contained.

@jeremybmerrill
Copy link
Member

The bounding box detection -- for PDFs that support it -- uses the line elements in the PDF. So it totally should work, I just think we're at a bit of an impasse about the solution.

Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants