Open sagarchotalia opened 2 years ago
Saw your notebook, great! https://github.com/sagarchotalia/radis-benchmark/blob/master/manual_benchmarks/chunksize_benchmark.ipynb
Can you somehow output the number of chunks ? I see there are only ~50,000 lines in the NO example, which is a small number, so it's possible that beyond chunksize=1e5 we're having all lines in a single chunk.
Yes, so I included the "N" calculation parameter, here are the results:
NO
, the temporary chunk dataframe dg
has a size of 1 until chunksize = 1e6
, since the N parameter gets larger than df1.chunksize = 1e6
, the sizes of the chunks in memory are:
Added a notebook for benchmarking of chunksize vs. non-chunksize computations, plotting the results.
Changes Made
NO
equilibrium spectrum, with ~53k lines.Future Objectives
chunksize = "auto"
feature, once it is implemented (see the Chunksize PR)