i.e. if there are no reads at all aligned to these sequences, ignore them later on in the filter.
I'm not sure how much time this would save, but there are a surprisingly large amount of these contigs in the SheepGut dataset (16,659 out of 78,793 contigs, by my count), so a small optimization might actually be nice. Not sure how long calling bf.fetch(seq) on an "empty" seq takes.
i.e. if there are no reads at all aligned to these sequences, ignore them later on in the filter.
I'm not sure how much time this would save, but there are a surprisingly large amount of these contigs in the SheepGut dataset (16,659 out of 78,793 contigs, by my count), so a small optimization might actually be nice. Not sure how long calling
bf.fetch(seq)
on an "empty"seq
takes.