I'm doing a join of two tables on a compound key of [int, int, int]. In newer versions of polars (inlcuding polars==1.4.1) it uses much more memory than I'd expect. I confirmed by rolling back to polars==0.19.19 and found it did use significantly less memory.
Expected behavior
I'd expect a left join for this problem to use something like 2x the space of the left table as it did in 0.19.19. I ran the script with python's memory-profiler package and used the commands
Checks
Reproducible example
I installed memo
Log output
Issue description
I'm doing a join of two tables on a compound key of [int, int, int]. In newer versions of polars (inlcuding
polars==1.4.1
) it uses much more memory than I'd expect. I confirmed by rolling back topolars==0.19.19
and found it did use significantly less memory.Expected behavior
I'd expect a left join for this problem to use something like 2x the space of the left table as it did in
0.19.19
. I ran the script with python'smemory-profiler
package and used the commandsRunning:
mprof run --python -o polars_141_small.prof -M --include-children python polars_join_bug_mwe.py
Plotting:mprof plot polars_141_small.prof --title polars_1_4_1 -o polars_141_small.png -w 0,12
Installed versions