Open karond-is-me opened 10 months ago
I tried this on Linux, with /usr/bin/time -v python script.py
to measure max resident memory, with version on main
from Jan 17 2024. Was unable to see any memory usage difference between runs with and without separator, albeit with a different file than the one reporter used.
After updating Polars to version 0.20.5, I noticed no discernible changes on my Windows computer.
Checks
Reproducible example
Log output
No response
Issue description
When attempting to read a large 18GB CSV file using streaming or batched reading methods, setting the separator parameter to a non-default value might lead to a memory explosion, despite this phenomenon not being reflected in the Windows process manager. Additionally, I speculate that bug #9266 may be related to this issue.
Expected behavior
To avoid loading the entire content into memory, you can utilize streaming or batched reading methods instead.
Installed versions