apeltzer / DeDup

A merged read deduplication tool capable to perform merged read deduplication on single end data.
GNU General Public License v3.0
9 stars 6 forks source link

DeDup log file incorrectly reports deduplication rate #10

Open jfy133 opened 4 years ago

jfy133 commented 4 years ago

DeDup currently calculates the deduplication rate based on number of reads after deduplicated over total reads in the BAM file. This is an incorrect calculation as if the BAM file includes unmapped reads, this vastly reduces the rate because of very large denominators.

The denominator should be mapped reads prior deduplication, as deduplication rate can only be calculated for reads upon which deduplication can be applied.