Open timodonnell opened 9 years ago
I'm also thinking about the best way to filter variants by expression level (of either genes, transcripts, or allele-specific read count). Do you think these two use cases have enough in common to suggest a filtering API?
Maybe just a VariantCollection.filter
function that takes a variant -> bool
callable and returns a new VariantCollection
?
Then could add a new module with filter implementations, including my and your examples.
For some of our analyses it would be helpful to be able to filter a VariantCollection to only those variants that fall within the intended capture targets.
Example intervals list:
GATK also supports a few other formats (probably not needed here though): https://www.broadinstitute.org/gatk/guide/article?id=1319