Open Rridley7 opened 1 year ago
Hi,
Sure, it is just based on NM tags plus lengths of the read etc. The cigar string cannot be reliably used for %ID because mapping software often does not distinguish between a match and a mismatch - both are encoded as 'M'. ben
I see. So if I'm understanding correctly, % identity is calculated as (length of aligned region - NM) / (length of aligned region)? Is the cigar used for this length of this aligned region, the start and end reference positions, or something else?
Following this, the min-read-aligned-length
is the difference of this alignment length - original read length / original read length?
Hello, I wanted to ask what methods and calculations are being used to calculate percent identity and alignment length percentage when filtering? Is this related to the NM tag, or parsing of the cigar string? Thanks!