Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

added 2 new options to make ParseXLSX more efficient with Large Multi-tab Workbooks #98

Open
wants to merge 4 commits into
base: master
Choose a base branch
from

Conversation

cboleary
Copy link

@cboleary cboleary commented Apr 2, 2021

This is an attempt to fix a performance issue with the way ParseXLSX handles large spreadsheets.
If you use the ReadData method, it parses every worksheet in the workbook and this can consume alot of time
especially if you just want to find out the worksheet tab names

Added option:
--just_find_sheet_names
If just_find_sheet_names is set, the code will quickly return the worksheetInfo without parsing each worksheet
and
--sheet_filter comma separated string of worksheet names to parse
if sheet_filter is not set, the module operates as it did, parsing EVERY worksheet in the workbook
if its set, the module will only parse the worksheets listed which can be way more efficient

--just_find_sheet_names
If just_find_sheet_names is set, the code will quickly return the worksheetInfo without parsing each worksheet
and
--sheet_filter comma separated string of worksheet names to parse
if sheet_filter is not set, the module operates as it did, parsing EVERY worksheet in the workbook
if its set, the module will only parse the worksheets listed which can be way more efficient
…tested)

I needed to return a shell of a WorkSheet with an empty list for the Cells Attribute
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

1 participant