Ensembl / ensembl-vep

The Ensembl Variant Effect Predictor predicts the functional effects of genomic variants
https://www.ensembl.org/vep
Apache License 2.0
445 stars 151 forks source link

Use of uninitialized value in subtraction (-) #886

Closed LenkaSeg closed 3 years ago

LenkaSeg commented 3 years ago

Describe the issue

When running VEP this message appears:

Use of uninitialized value in subtraction (-) at /opt/vep/src/ensembl-vep/Bio/EnsEMBL/TranscriptMapper.pm line 384, <__ANONIO__> line 1.

The output is fine.

Additional information

Happens only for the hgvs form of input: example: AKT1:Asp46Glu

There was some remotely similar issue here #636 I tried everything mentioned there, including the upgrading the tabix, but it didn't help.

System

Full VEP command line

vep -i input.tsv --force_overwrite -o STDOUT --tab --fields "Uploaded_variation,Location,Gene,Consequence,SYMBOL,ENSP,CANONICAL,Amino_acids,Protein_position,Feature,EXON,INTRON,Codons" --symbol --protein --numbers --canonical --no_stats --cache --fasta [path/to/Homo_sapiens.GRCh38.dna.toplevel.fa.gz] --dir_cache [path/to/cache]

Full error message

Use of uninitialized value in subtraction (-) at /opt/vep/src/ensembl-vep/Bio/EnsEMBL/TranscriptMapper.pm line 384, <__ANONIO__> line 1.
Use of uninitialized value in subtraction (-) at /opt/vep/src/ensembl-vep/Bio/EnsEMBL/TranscriptMapper.pm line 385, <__ANONIO__> line 1.

Data files (if applicable)

They include:

aparton commented 3 years ago

Hi @LenkaSeg,

Thank you for this report. Unfortunately, we do not support HGVS in this format as it is missing a variant descriptor (i.e. :p., :g. or :c.). Additionally, we do not recommend using gene symbols as the reference sequence, as this gives ambiguity for which transcript or translation is intended.

If you use the associated Ensembl protein identifier instead, and add the variant descriptor (e.g. ENSP00000270202.4:p.Asp46His), then VEP should annotate your variants correctly. As an aside, VEP will only be able to provide annotations with HGVSp input if there is an unambiguous way to change from reference to alternate amino acid at the genomic level.

Kind Regards, Andrew

LenkaSeg commented 3 years ago

Oh, sorry, I made a mistake, the variant descriptor was actually part of the input. example: AKT1:p.Asp46Glu

Thank you! I will use the protein identifier. I had noticed that no example using gene symbol was given in the input documentation, but it seemed to work fine and without delay in the web interface, so I tried to use it in the command line too.

Would you like me to make a pull request for the input documentation with this example and your explanation? Maybe it would help others too. Another thing that might be useful is to present a warning in stderr when the --verbose flag is enabled.

Thank you very much for your reply and help!

aparton commented 3 years ago

Hi,

I'm going to close this issue now. If you have any further questions, please feel free to reopen it or create a new issue.

Kind Regards, Andrew