doy / spreadsheet-parsexlsx

parse XLSX files
http://metacpan.org/release/Spreadsheet-ParseXLSX
27 stars 35 forks source link

added 2 new options to make ParseXLSX more efficient with Large Multi-tab Workbooks #98

Open cboleary opened 3 years ago

cboleary commented 3 years ago

This is an attempt to fix a performance issue with the way ParseXLSX handles large spreadsheets. If you use the ReadData method, it parses every worksheet in the workbook and this can consume alot of time especially if you just want to find out the worksheet tab names

Added option: --just_find_sheet_names If just_find_sheet_names is set, the code will quickly return the worksheetInfo without parsing each worksheet and --sheet_filter comma separated string of worksheet names to parse if sheet_filter is not set, the module operates as it did, parsing EVERY worksheet in the workbook if its set, the module will only parse the worksheets listed which can be way more efficient