schneebergerlab / syri

Synteny and Rearrangement Identifier
https://schneebergerlab.github.io/syri/
MIT License
323 stars 35 forks source link

Is translocation one kind of duplication variation? #147

Closed sen1019san closed 2 years ago

sen1019san commented 2 years ago

Hi Manish, I find translocation regions are not 1:1 in the same chromosome between query and ref genome comparison. Here is an example that SYNAL is overlapping with TRANSAL:

image

I try to explain this variation example: image

It means that SYNAL24765 overlapping with TRANSAL38969, SYNAL14400 overlaps with TRANSAL38971, and SYNAL14401 overlaps with TRANSAL38972.

I wonder whether the translocation is 1:1 or 1:much between query and ref genome.

Thanks!

mnshgl0110 commented 2 years ago

Syri allows some overlap between translocations and other variants. This is overlap is estimated at the translocation block level and not for individual alignments. For example, syri would call the blue region as translocated even though there is some overlap with syntenic (grey) regions image

When a translocation consists of multiple alignments, then it is possible that some alignments overlap with syntenic regions like below. image

I think this is the case that you have noticed. If you check the endpoints of TRANS22334 block, then there should be regions that are not overlapping with any syntenic/inverted region effectively making it a translocation. You can regulate the allowed overlap by changing the --unic and --unip parameters.

sen1019san commented 2 years ago

Thanks a lot, Manish. I am confused with the syri output for translocation and duplication definition. May I naively think that translocation is 1:1 between the reference genome and query genome.