Closed ernfrid closed 9 years ago
Yes, it is currently the case that only one "clip" opcode is expected at the beginning or end of a CIGAR string. If there are more than one at the beginning, all but the first will not be included in the start clip, and the last clip in the entire CIGAR string will be treated as the end clip.
Thanks for pointing this out. This is clearly a bug, and I will fix it as soon as I can. However, I am just curious as to what aligner or other tool produced this SAM file.
Thanks, Greg
I faked this data to illustrate the bug. I expect it is not a common occurrence.
Fixed in release 0.1.22.
It's possible to have multiple hard clip and soft clip operations at the beginning of a read. Currently, I believe samblaster only parses the first of these as a "sclip" event and any subsequent as an end clip.
Here's a SAM example of the behavior. I believe 3/4 reads should be marked as duplicates but only 2 of 4 are.