Closed MatthewRalston closed 8 years ago
From the usage of CleanSam: "Cleans the provided SAM/BAM, soft-clipping beyond-end-of-reference alignments and setting MAPQ to 0 for unmapped reads". Looking at the code, that is exactly what it does, no more, no less.
So, no CleanSam does not remove duplicate alignments. if you don't want duplicate alignements in your bam, it might be easier to tell TopHat to only emit one alignment... (I think -g 1 is the flag you want.)
Is it correct to assume that "Clean" sam files should be usable by other Picard tools? It would be useful if CleanSam removed duplicate alignments from BAM files. Tophat alignments with duplicate alignment records that are then cleaned with CleanSam do not play nicely with MarkDuplicates.
Tophat v2.1.1 Picard v2.2.2