write_empty_chunks and zarr.zeros

zarr-developers / zarr-python

An implementation of chunked, compressed, N-dimensional arrays for Python.

MIT License

1.46k stars 274 forks source link

hi @AlexHenderson, try passing write_empty_chunks=False as a keyword argument to zarr.save.

Your code example is actually creating two separate zarr arrays: the first array is created with the zarr.zeros function, and it uses in-memory storage, because that's the default storage backend if none is specified. zarr.save will create a second array on the file system using one of the file system-based storage backends. That second array is a structurally identical copy of the first array, but write_empty_chunks is a runtime detail, it's not part of the array metadata, so that doesn't get automatically copied over when you create the second array.

you can actually skip the invocation of zarr.save by passing a store keyword argument to zarr.zeros, e.g.

arr = zarr.zeros(store='testlocation', path='my_array', (10000, 10000), chunks=(1000, 1000), dtype='i4', write_empty_chunks=False)

zarr-developers / zarr-python

write_empty_chunks and zarr.zeros #2060