ohnosequences / mg7

Configurable and scalable 16S metagenomics data analysis
https://goo.gl/y3rZFD
GNU Affero General Public License v3.0
3 stars 3 forks source link

Filter out empty reads on BLAST #135

Open laughedelic opened 6 years ago

laughedelic commented 6 years ago

See https://github.com/era7bio/salmon16spb/pull/9#issuecomment-346541632 for the context. BLAST fails badly on empty reads. So we shouldn't pass them to BLAST and record their IDs in a separate output. Together with #130 we will have 3 types of discarded outputs from BLAST:

This particular filtering could be also done on split step. I'll think about it.