cedadev / cmip6-object-store

CMIP6 Object Store Library
BSD 3-Clause "New" or "Revised" License
4 stars 4 forks source link

Run comparison of different retrieval patterns from object store #35

Open agstephens opened 4 years ago

agstephens commented 4 years ago

@mjones07 please describe the plan for this work here and tag @agstephens and @philipkershaw for comment.

mjones07 commented 4 years ago

@philipkershaw @agstephens the plan for this is to:

Read from the 3 different file types

Have 3 (maybe more) read patterns

This is timed and saved into a pandas json file with current headers:

Current issues are:

agstephens commented 4 years ago

@mjones07: thanks for providing details. This all looks good. I agree that combining (b) and (d) should increase the robustness of the tests.

Another potential issue that might interfere is:

I suspect that many repeats are the best option for reducing the noise that might be created by contention with others who are accessing Caringo.

mjones07 commented 4 years ago

Testing code is done and map and timeseries reads are in progress for S3netCDF4, netCDF4 (byte range), and xarray/zarr

philipkershaw commented 4 years ago

@mjones07, from conversation with Pangeo people, there's a newer version of xarray/zarr that integrates an update to fspec that uses asyncio. This could change the results significantly. I don't know how far you've got with the tests and whether it would be possible to investigate this new release.