pangeo-data / pangeo

Pangeo website + discussion of general issues related to the project.
http://pangeo.io
699 stars 188 forks source link

Use Case Notebooks: COSIMA Cookbook #13

Closed jmunroe closed 6 years ago

jmunroe commented 6 years ago

I've been developing a set of notebooks to analyze fairly large MOM5 ocean model output for at 1, 0.25, and 0.1 degree resolutions. Under the hood, it is primarily using xarray/dask with the intention of building upon what is to be developed from within the Pangeo project. I'll put a link here in case it is of use to others:

COSIMA Cookbook

Most members of COSIMA (Consortium for Ocean-Sea Ice Modelling in Australia) use the Australian National Computing Infrastructure (NCI) on a machine called Raijin. One thing that I am doing is experimenting with some of the advances being made within xarray/dask that are currently being used on Cheyenne and applying them to Raijin as well.

Would there be value in developing a stand-alone 'use case notebook' that for Pangeo that shows analysis of MOM5 model output?

mrocklin commented 6 years ago

Would there be value in developing a stand-alone 'use case notebook' that for Pangeo that shows analysis of MOM5 model output?

I suspect that the answer here is "definitely", but I'll let the science folks handle this question.

One thing that I am doing is experimenting with some of the advances being made within xarray/dask that are currently being used on Cheyenne and applying them to Raijin as well.

I'm very glad to hear this. Please be vocal about anything that you run into. To be clear our goal is to improve the experience for the entire community, not just users of Cheyenne.

rabernat commented 6 years ago

@jmunroe thanks so much for your contributions! It looks like you are doing some great work. Your COSIMA cookbook is awesome. I'm personally really interested in applying diagnostics like this to my own modeling studies.

Would there be value in developing a stand-alone 'use case notebook' that for Pangeo that shows analysis of MOM5 model output?

Absolutely. One challenge might be that you appear to have some dependencies on non-standard packages. (That's not a dealbreaker.) Your kinetic energy notebook could be a good candidate, since it's a relatively simple and very common diagnostic for eddy-resolving ocean models. I think there are some general-use needs that are identified here, such as the "box index" approach you are using to coarse graining. (FWIW, I'm almost certain that the best way to do this sort of thing is via data shader.)

Here is a list of requirements we developed for the other use cases:

Although we don't have MOM5 output on Cheyenne, we do have high-resolution POP output, which should in principle be quite similar in terms of processing needs. Depending on the amount of data, we could also consider transferring over a sample of the data for benchmarking purposes.

stale[bot] commented 6 years ago

This issue has been automatically marked as stale because it has not had recent activity. It will be closed if no further activity occurs. Thank you for your contributions.

stale[bot] commented 6 years ago

This issue has been automatically closed because it had not seen recent activity. The issue can always be reopened at a later date.