Closed jperales closed 4 weeks ago
Just to let you know that this finding is also present in VEP 112 . Thanks!
Javier
Hi Javier,
Hope you are doing well.
The VCF specification (section 1.6.1) denotes the following:
- REF — reference base(s): (...) If any of the ALT alleles is a symbolic allele (an angle-bracketed ID String
<ID>
) then the padding base is required and POS denotes the coordinate of the base preceding the polymorphism.
So, for a variant with a symbolic allele in VCF format, the position reflects the preceding base of the structural variant. As such, you need to increment 1 to get its start position.
Hope this was clear, but tell me if otherwise.
Cheers, Nuno
Hi Nuno, You made it very clear. Apologies I did not know about this VCF specification in the POS for structural variants. Then it makes sense to increment 1 the start position as it is already implemented. I close this issue. Thank you very much!
Best, Javier
I find confusing that
VEP custom
outputs the coordinates of the annotation derived from a Structural Variant (SV) by adding +1 to the start position despite the coordinates being in a 1-based coordinate system already. It reminds me of a conversion from 0-based to 1-based coordinate system. However the--custom file
is a VCF, which is 1-based already. So it would not make sense to make such conversion. Could you please tell me more about the coordinate system of the annotation derived from Structural Variants?Moreover, I noticed that it does not happen with annotations derived from SNVs, it does only with SVs. If that is a bug, please feel free to add a label to it. Thank you!
I include toy examples below for both SNV and SV cases below with the following summary table:
chr1 113834946 . A G . . .
chr1 113834946 rs2476601 A G . . .
chr1:113834946-113834946
chr19 418826 . N <DEL> . . END=467860
chr19 418826 variant_is_80_245973__DEL N <DEL> . PASS END=467860
chr19:418827-467860
Test case: SNV
Input variant (VCF format)
Custom file (VCF format), showing SNVs at the position.
Command line and output --> Annotation coordinates:
"name":"chr1:113834946-113834946"
Test case: SV
Input variant (VCF format):
Annotation variant
Command line and output --> Annotation coordinates:
"name":"chr19:418827-467860"
System
Data files
They include: