fritzsedlazeck / SURVIVOR

Toolset for SV simulation, comparison and filtering
MIT License
347 stars 46 forks source link

fail to simulate nested SVs like INV_del or INV_dup #138

Closed Jingwen7 closed 3 years ago

Jingwen7 commented 3 years ago

Hi @fritzsedlazeck,

Thanks for developing SURVIVOR! I wanted to simulate nested SVs. I specify the number of INV_del and INV_dup I want to simulate in the parameter_file. But there is no INV_del and INV_dup in the resulting simulated.bed and simulated.vcf. Do you have an idea what could be causing this? Thank you! my parameter_file looks like this: PARAMETER FILE: DO JUST MODIFY THE VALUES AND KEEP THE SPACES! DUPLICATION_minimum_length: 100 DUPLICATION_maximum_length: 10000 DUPLICATION_number: 100 INDEL_minimum_length: 20 INDEL_maximum_length: 10000 INDEL_number: 100 TRANSLOCATION_minimum_length: 1000 TRANSLOCATION_maximum_length: 5000 TRANSLOCATION_number: 100 INVERSION_minimum_length: 600 INVERSION_maximum_length: 2000 INVERSION_number: 100 INV_del_minimum_length: 600 INV_del_maximum_length: 1000 INV_del_number: 50 INV_dup_minimum_length: 600 INV_dup_maximum_length: 1000 INV_dup_number: 50

Best, Jingwen

Jingwen7 commented 3 years ago

And the reference I'm using is chr20, chr21 and chr22.

fritzsedlazeck commented 3 years ago

@Jingwen7 thanks for posting this. Can you check (I would need to run it myself again since its been a while) if there are INV with adjacent DEL events? Back then I remember that I reported these events seperately. Thanks Fritz

Jingwen7 commented 3 years ago

@fritzsedlazeck Thanks for the quick reply! I checked the simulated.vcf. Just to comfirm, the first image shows a INVDUP, right? Because there is a INV and DUP share exact same coordinates. And the second image shows a INVDEL, since a INV and DEL are right beside each other. image

image
fritzsedlazeck commented 3 years ago

yes right. that's how I implemented it . I wanted to use the existing types and not invent something new. Cheers Fritz

Jingwen7 commented 3 years ago

Thanks for the reply! I also want to ask a followup question about ngmlr and sniffles. After simulating SVs and simulating reads from the altered reference, I get alignment from several aligners (including ngmlr). And I used sniffles to call SVs from alignment. But the resulting vcf file from sniffles only contains less than 10 SVs (for each aligner), which is way below what I've simulated. Do you have any idea what is causing this?

Best, Jingwen

fritzsedlazeck commented 3 years ago

Maybe check in IGV to see if you can look at the SV and one other common thing would be the simulated coverage vs. the -s parameter from Sniffles that is set.

Jingwen7 commented 3 years ago

Yes! The default -s parameter is 10, which is relatively large comparing to my simulated coverage (10x). It works now! Thank you very much! :)

fritzsedlazeck commented 3 years ago

great. I will close this issue. thanks Fritz