getzlab / rnaseqc

Fast, efficient RNA-Seq metrics for quality control and process optimization
Other
146 stars 19 forks source link

Handling of supplementary alignments #45

Open porchard opened 4 years ago

porchard commented 4 years ago

Hi,

Thanks for the tool! I recently ran this on a large cohort and noticed some oddities in the output (e.g., end 1 mapping rates > 1 for a subset of the libraries). The issue seems to be attributable to a large number of supplementary alignments. As far as I can tell RNA-SeQC doesn't check the 0x800 flag denoting supplementary alignments and this throws off some of the numbers -- for example, inflating the 'unique mapping, vendor QC passed reads'.

Given the set of metrics that RNA-SeQC calculates, I would have expected it to exclude supplementary alignments from many of the metrics as it does for secondary alignments. Is there any particular reason for ignoring the supplementary alignment flag?

Happy to put together a pull request to treat supplementary alignments in a similar manner as secondary alignments, if you determine that to be desirable.

agraubert commented 4 years ago

Thank you for bringing this to our attention! We apologize for not catching this issue earlier. We're currently working on adding in an appropriate filter for supplementary reads and will keep you updated

porchard commented 4 years ago

That sounds great -- I'll look forward to the updated version, then!

agraubert commented 4 years ago

Just wanted to give you an update. We've updated the way RNA-SeQC counts chimeric reads and how it handles supplementary alignments. We're testing the new changes and expect to finalize the new changes soon

porchard commented 4 years ago

Thanks! I'll give it a go once the changes are pushed.