populationgenomics / saige-tenk10k

Hail batch pipeline to run SAIGE on CPG's GCP
MIT License
0 stars 0 forks source link

always subset by locus first when doing locus and sample #162

Closed Alexander-Stuckey closed 6 days ago

Alexander-Stuckey commented 6 days ago

As noted in https://discuss.hail.is/t/filtering-samples-from-vds-in-google-cloud/3718/6, it's always faster to subset by locus / interval first, if possible. So just updating the code to do that.