KolmogorovLab / Severus

A tool for somatic structural variant calling using long reads
Other
105 stars 5 forks source link

INFO/END is smaller than POS #11

Closed linsindian closed 1 month ago

linsindian commented 8 months ago

Hi @aysegokce ,

I'm currently trying to replicate your testing on COLO829 using the COLO829 nanopore data provided by epi2me labs. The workflow likes: Clair3 -> whatshap phasing and tagging -> severus call SV -> truvari bench.

However, I encountered an error while attempting to index severus_somatic.vcf using bcftools: [W::tbx_parse1] VCF INFO/END=10003 is smaller than POS at chr1:180575. It seems that this issue stems from the possibility of encountering END being smaller than POS when SVtype is BND.

Could you please explain how you operate Truvari as a benchmarking tool?

Best, Sin-Dian

aysegokce commented 8 months ago

Hello @linsindian,

Truvari only works with sequence-resolved SVs like INDELs and cannot handle BNDs. Since SVs in the cancer genome are BNDs in major, we couldn't find a tool that can handle different representations. We developed Minda, optimized for cancer data and agnostic to SV types, to merge and compare VCFs. It will be available soon.

In the vcf format, BND calls need to be duplicated as a-to-b and b-to-a, and in the b-to-a row, END will be smaller than the POS.

Best Ayse

TBradley27 commented 5 months ago

Hi @aysegokce,

I was just wondering whether what you wrote is also applicable to SVs of type INV?

When using snpeff we saw a very similar error for an INV SV as to what is reported above