KangchengHou / admix-kit

Toolkit for analyzing genetics data from admixed populations
https://kangchenghou.github.io/admix-kit
22 stars 5 forks source link

on-disk memory efficient parser for vcf to zarr #2

Closed KangchengHou closed 2 years ago

KangchengHou commented 2 years ago

see https://github.com/pystatgen/sgkit/blob/d3b77ed878ce421c9363b948a9796f46572fa44a/sgkit/io/vcf/vcf_reader.py#L358 or vcf_to_zarr in scikit-allel

We may first apply that function, format ourselves as a quick fix.