Open tseemann opened 4 years ago
Yes, the region from 104092 to 104221 is duplicated since this region is mapped with multi-mapping reads. MapCaller identifies CNVs with multi-mapping reads. MapCaller does not predict the copy numbers since they are difficult to estimate.
So the reads aligning to this region in the reference genome ALSO align to another region in the reference genome.
Where is that other region?
Can you add tags to links all the duplicate regions?
Maybe put DUP00001
in the ID
column for all the same DUP regions?
It is a good suggestion. I'll try to implement this feature.
How do i interpret this line?
It was introduced only in the ChangeLog as:
Does it mean the region from 104092 (which starts with a
T
) to 104221 (which is 109 bp long) is duplicated exactly somewhere else in the genome? If so, where is it duplicated? What if it is triplicated? how do we know whatN
you predicted for it? Does this have anything to do with multi-mapping reads?