documaster / noark-extraction-validator

GNU Affero General Public License v3.0
6 stars 3 forks source link

Add support for large excel reports generation #52

Closed IvanPenevDocumaster closed 4 years ago

IvanPenevDocumaster commented 4 years ago

This is done using an SXSSFWorkbook which writes the contents to the disk automatically once a certain threshold is reached. The old XSSFWorkbook is replaced but can still be used with the '-old-xlsx-report-generation' flag in case of any unforeseen issues.

Also support for HSQL 'file' database is added which runs roughly 3 times slower than the 'in memory' one but can handle large amounts of data. By default its contents are located in the OS temp directory but can be explicitly specified.