related-sciences / gwas-analysis

GWAS data analysis experiments
Apache License 2.0
24 stars 6 forks source link

Explore Xarray as the basis for a genetic toolkit API #5

Closed eric-czech closed 4 years ago

eric-czech commented 4 years ago

I'm not sure what advantages having labeled axes has for big call matrices, but xarray may make sense as a way to carry along the variant and sample metadata (as opposed to dask dfs). It will be worth a shot to see how useful that interface is.

eric-czech commented 4 years ago

Notable Xarray limitations found so far are:

Note: some of these were aggregated from https://github.com/related-sciences/rs-platform/issues/19#issuecomment-594211481

eric-czech commented 4 years ago

There isn't much remaining doubt about Xarray. Updates on some of the limitations above:

The others are still legitimate limitations but don't affect core GWAS operations.