malariagen / ag1000g-phase3-data-paper

Other
1 stars 2 forks source link

q6 Add pipeline to create allele count arrays #31

Closed hardingnj closed 3 years ago

hardingnj commented 3 years ago

pipeline will generate 3 separate zip zarrs for 3 species groups with regions and selection of samples handled.

Working towards https://quire.io/w/malariagen-ag3.0-paper/6/PCA_analysis_-_first_pass?filter=all

hardingnj commented 3 years ago

Tagged @alimanfoo as a reviewer, just a bit unsure on using dask non-interactively. Specifically whether try/finally is appropriate.

I don't want to end up billing $$$.

cclarkson commented 3 years ago

Looks cool to me but it will be best if @alimanfoo checks the dask.

cclarkson commented 3 years ago

@hardingnj - what's the plan with running this pipeline? I'll be needing the output for UMAP at some point.

hardingnj commented 3 years ago

superseded by #34 (by @cclarkson), but @alimanfoo comments still relevant here.