zarr-developers / zarr-paper

Repository for developing an initial article describing Zarr for peer-reviewed publication
Other
1 stars 1 forks source link

Public datasets #2

Open alimanfoo opened 5 years ago

alimanfoo commented 5 years ago

Within the paper it would be good if we could point to a number of datasets that have been converted to zarr and made publicly available, to serve as exemplars and for anyone interested in the paper to access and try out.

alimanfoo commented 5 years ago

On the genomics side, I think I will do a conversion of the human 1000 genomes phase 3 variation dataset from VCF to zarr. I will probably then upload the data to Google cloud object storage, and may also make it available for download via a public FTP site.

I may also do something similar for the Anopheles gambiae 1000 genomes phase 2 variation dataset.

joshmoore commented 5 years ago

The imaging format discussed recently is of course a work-in-progress and so likely out of scope, but there are definitely publicly available imaging datasets that could be converted if interest arises.

cc: @ambrosejcarr