Closed lognaturel closed 2 years ago
openpyxl
read only mode appears to bring mem usage back to v1.7 levels. Hurray for an easy fix. 😮💨
Looks like it's a specific form running into something like what this post describes: https://stackoverflow.com/questions/47582274/iterate-through-columns-in-read-only-workbook-in-openpyxl
There are memory gains to using read-only mode so we should still do it but this is not as critical as it originally seemed.
For whatever reason, the specific form this was seen on had a huge number of extra columns. Deleting those makes the form convert quickly with minimal memory usage with v1.8.0. I still would like to do #596 but it's not critical. CC @aurdipas
More users have been reporting this so grateful for your help getting it addressed, @lindsay-stevens 🚀
A 200kb Excel doc was measured using 1.5GB of memory.
The most likely culprit is
openpyxl
, introduced in #575 by @sheppardFrom https://openpyxl.readthedocs.io/en/stable/performance.html:
What we're seeing is higher by two orders of magnitude.