Open alimanfoo opened 5 years ago
On the genomics side, I think I will do a conversion of the human 1000 genomes phase 3 variation dataset from VCF to zarr. I will probably then upload the data to Google cloud object storage, and may also make it available for download via a public FTP site.
I may also do something similar for the Anopheles gambiae 1000 genomes phase 2 variation dataset.
The imaging format discussed recently is of course a work-in-progress and so likely out of scope, but there are definitely publicly available imaging datasets that could be converted if interest arises.
cc: @ambrosejcarr
Within the paper it would be good if we could point to a number of datasets that have been converted to zarr and made publicly available, to serve as exemplars and for anyone interested in the paper to access and try out.