Closed kaczmarj closed 4 months ago
@kaczmarj I could take this issue up! What would be the best chunk size for our case? I was looking at this StackOverflow answer regarding chunksizes. Going through some more documentation to see what will be best for our case
great! thanks for the link to the stack overflow answer. according to that, inter-process communication time is an important consideration when choosing the chunk size. the thing is, the duration of the conversion greatly outweighs the duration of process switching.
i wonder if chunksize=1
would be best actually...
could you test a few and see which gives the best timing results? perhaps test 1, 4, 10, and 20.
Sure. I will run some tests and will see what gives the best results.
I tried out different chunksizes. Here is what I observed:
Exp 1
:
Time elapsed
= 7.646886587142944Exp 2
:
Time elapsed
= 20.596678256988525Exp 3
:
Time elapsed
= 42.19792675971985Exp 4
:
Time elapsed
= 46.71947002410889I believe using chunksize = 1 seems to be the best option.
thanks @swaradgat19