Our DNA alignment pipeline currently leaves both mapped and unmapped reads in the BAM. This makes it easier to later inspect the unmapped reads (e.g. were they viral?) and also lets us estimate the mapping rate directly from sambamba flagstat output.
STAR, on the other hand, drops unmapped reads from the output BAM by default.
To retain unmapped reads, this PR adds --outSAMunmapped Within to invocations of STAR.
Here's the STAR reference manual entry for this option:
--outSAMunmapped
default: None
string: output of unmapped reads in the SAM format
None
no output
Within
output unmapped reads within the main SAM file (i.e. Aligned.out.sam)
Our DNA alignment pipeline currently leaves both mapped and unmapped reads in the BAM. This makes it easier to later inspect the unmapped reads (e.g. were they viral?) and also lets us estimate the mapping rate directly from
sambamba flagstat
output.STAR, on the other hand, drops unmapped reads from the output BAM by default.
To retain unmapped reads, this PR adds
--outSAMunmapped Within
to invocations of STAR.Here's the STAR reference manual entry for this option: