Duplicate value for merged cell instead of None
#420
Labels
feature-request
All feature requests receive this label initially, can be upgraded to "enhancement"
None
#420
Hi there,
Firstly, I want to thank you for creating this awesome
pdfplumber
project. I have been using it a lot lately.Secondly, I'm facing a problem and want to propose change to the output of the table module regarding the utility
extract_tables
with merged cells.As it is now, only the first row of the merged cell has value, the remaining cells get
None
.It would be more semantically correct to have the value in merged cell duplicated like this
I'm preparing a PR, please consider it.
Thank you!
Update 1
Here's the PR: #422
Approach: finding the centroid of each "cell", and then get the cells from the centroids for each row.
Update 2:
I've encountered this case:
Expected output is:
None
means not a cell there, and it makes the number of elements is even between rows which is convenient for further processing.Note: the PR is updated
The text was updated successfully, but these errors were encountered: