pydata / xarray

N-D labeled arrays and datasets in Python
https://xarray.dev
Apache License 2.0
3.57k stars 1.07k forks source link

Zarr Python 3 tracking issue #9515

Open jhamman opened 1 week ago

jhamman commented 1 week ago

What is your issue?

Zarr-Python 3.0 is getting close to a full release. This issue tracks the integration of the 3.0 release with Xarray.

Here's a running list of issues we're solving upstream related to integration with Xarray:

Special shout out to @TomAugspurger has been front running a lot of this 🙌.

TomAugspurger commented 1 week ago

High Level Plan

We can think about a couple lines of related work:

  1. Get xarray working with zarr-python 3.x (read / write Zarr v2 data)
  2. Support Zarr v3

I think that supporting zarr-python 3.x is the primary goal for now.

Work Items

These PRs are needed on top of zarr-python v3 to get anything working:

These are some issues we'll need to resolve:

Changes for zarr-python:

Changes for xarray:

Fixed issues

Things to investigate:

dcherian commented 3 days ago

@TomAugspurger are you able to open a WIP PR with in-progress work. It'd be nice to see what's needed

TomAugspurger commented 3 days ago

Sure, https://github.com/pydata/xarray/pull/9552 has that.

TomAugspurger commented 17 hours ago

Question for the group: does anyone object to xarray continuing to write Zarr V2 datasets by default? I hesitate to have xarray's default be different from zarr-python's, but that would relive some pressure to address https://github.com/pydata/xarray/issues/5475 quickly, since v2 datasets should be round-tripable.