Open baraaorabi opened 1 year ago
P.S.: This might be related to #502?
Yes, related but not the same. We may move 437I
around in this case, though there will still be a CIGAR like xxxIyyyN
, which is hard to resolve.
Can it not be resolved by linear time postprocessing? Something like anochoring the (mis)matches in the CIGAR string, and then sorting-and-merging any other CIGAR operations between (mis)match anchors?
No, can't.
I found that Minimap2 sometimes generates an unusual order of CIGAR opertations; with some
N
s, followed by someI
s then followed by moreN
s instead of the more expected order of having all theN
s combined together.Here is an example that produces this result:
You can see that after the first exon,
101S61M2I99M2D1M1I9M1D24M
, there are183N
skip, followed by437I
insertion, followed by another skip of131864N
.