C2SM / iconarray

R&D Python package for processing ICON data with xarray
https://c2sm.github.io/iconarray/
BSD 3-Clause "New" or "Revised" License
10 stars 2 forks source link

Read-in Performance #35

Open regDaniel opened 1 year ago

regDaniel commented 1 year ago

This issue is more a documentation for us. We try to optimize the read-in with @clairemerker

Some timings:

more timings with Dask (open 10 icon forecast files and extract one variable):

when first merging files with cat:

All timings were tested on Tsa reading from /store, reading from /scratch reduces read-in times by approximately 10%.

regDaniel commented 1 year ago

I think, we gained some more experience with this during the development of icon_timeseries. Can we close this one @clairemerker or do you think it is still relevant for iconarray? If yes, I should probably update the timings.

clairemerker commented 1 year ago

In a sense the issue is still relevant, @victoria-cherkas and I will write a new version of open_dataset() for iconarray based on what we learned in icon-timeseries. No need to update the timings in my opinion, but maybe keep the issue open, we can close it after the new implementation.