iqbal-lab-org / pling

Plasmid analysis using rearrangement distances
MIT License
28 stars 1 forks source link

Integerisation from alignment misses duplicates in overlapping matches #9

Closed babayagaofficial closed 9 months ago

babayagaofficial commented 1 year ago

Screenshot from 2023-08-08 17-58-15

in the above example there is a large overlap between two matches that intuitively ought to break up the two matches into three, and produce integerisation:

ref 1 2 3 4 5 6 7 query 7 1 -3 13 -2 4 5 6 7

furthermore we see that in the integerisation from annotation that there is indeed a duplication however we get:

ref 1 2 3 4 5 6 query 1 -3 13 -2 4 5 6

ought to try catch large overlaps and treat overlap as a separate block