Closed ritchie46 closed 3 years ago
unfortunately it didn't help for G1_1e9_1e2_0_0, rest is still running
unfortunately it didn't help for G1_1e9_1e2_0_0, rest is still running
Hmm.. :slightly_frowning_face: Again killed before any question was executed?
yes, full output is
# groupby-polars.py
loading dataset G1_1e9_1e2_0_0
Killed
Thanks. Back to the drawing board.
This PR sets
low_memory
toTrue
while parsing the csv.Furthermore we shrink the arrays after we have coerced to
Categorical
and we make sure that the global string cache is emptied when not needed anymore.Hopefully, this solves the problem when loading the 50GB dataset.