Closed gillins closed 6 months ago
Hi @gillins
Yes, I think this is was I understood from the presentation.
Will be interesting to see what difference it makes.
@petebunting just to close the loop on this. Turns out setting paged aggregation really increases the file size:
Size of file | |
---|---|
Current kealib | 736m |
These Changes | 715m |
This plus pageSize=2048 | 952m |
This plus pageSize=4096 | 1.7G |
This plus pageSize=8192 | 3.9G |
So I think it's worth changing the chunksize (all that this PR now does) but let's not bother with the paged aggregation. It increases the filesize by quite a lot (probably because we have lots of small datasets in a KEA file - ie each block) plus breaks the API.
With just changing the chunk size we do get smaller files.
What do you think?
Hi @gillins
Yes, that sounds like a good way forward. I don't think we want file sizes growing that significantly.
@petebunting is this what you understood from that presentation?
I haven't done any testing with it yet....
cc: @neilflood