broadinstitute / viral-ngs

Viral genomics analysis pipelines
Other
190 stars 67 forks source link

from human depletion databases, mask any sequence similar to viral or bacterial #805

Open notestaff opened 6 years ago

notestaff commented 6 years ago

Reduce the chance of erroneously depleting viral or bacterial sequences that have remote homology to human.

tomkinsc commented 6 years ago

Brian Bushnell at JGI has a tool for masking by similarity and also low entropy regions, BBMask, could be worth a look: https://jgi.doe.gov/data-and-tools/bbtools/bb-tools-user-guide/bbmask-guide/ (background thread:) http://seqanswers.com/forums/showthread.php?t=42552