added 2 new options to make ParseXLSX more efficient with Large Multi-tab Workbooks #98

cboleary · 2021-04-02T21:58:41Z

This is an attempt to fix a performance issue with the way ParseXLSX handles large spreadsheets.
If you use the ReadData method, it parses every worksheet in the workbook and this can consume alot of time
especially if you just want to find out the worksheet tab names

Added option:
--just_find_sheet_names
If just_find_sheet_names is set, the code will quickly return the worksheetInfo without parsing each worksheet
and
--sheet_filter comma separated string of worksheet names to parse
if sheet_filter is not set, the module operates as it did, parsing EVERY worksheet in the workbook
if its set, the module will only parse the worksheets listed which can be way more efficient

--just_find_sheet_names If just_find_sheet_names is set, the code will quickly return the worksheetInfo without parsing each worksheet and --sheet_filter comma separated string of worksheet names to parse if sheet_filter is not set, the module operates as it did, parsing EVERY worksheet in the workbook if its set, the module will only parse the worksheets listed which can be way more efficient

…tested) I needed to return a shell of a WorkSheet with an empty list for the Cells Attribute

cboleary mentioned this pull request Apr 3, 2021

Need a low overhead way to get worksheetNames and specify which sheetNames to process #99

Open

cboleary added 3 commits April 3, 2021 06:32

fixed just_find_sheet_names operation (the orginal was way off and un…

1391328

…tested) I needed to return a shell of a WorkSheet with an empty list for the Cells Attribute

ignore my emacs backups

eef465c

fixed sheet_filter creation

741de7f

cboleary mentioned this pull request Apr 15, 2022

*** Maintenance moved to r-gregmisc/spreadsheet-parsexlsx *** #97

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

added 2 new options to make ParseXLSX more efficient with Large Multi-tab Workbooks #98

added 2 new options to make ParseXLSX more efficient with Large Multi-tab Workbooks #98

cboleary commented Apr 2, 2021

added 2 new options to make ParseXLSX more efficient with Large Multi-tab Workbooks #98

Are you sure you want to change the base?

added 2 new options to make ParseXLSX more efficient with Large Multi-tab Workbooks #98

Conversation

cboleary commented Apr 2, 2021