Closed rklocke closed 1 year ago
Hi @rklocke,
I cannot reproduce the issue.
Could you please check if the file revel_b37.tsv.gz
has the location 1:66058513
?
Hi @dglemos,
The file revel_b37.tsv.gz
, has this location, the second entry is the REVEL score for the relevant variant:
docker run -v /home/dnanexus:/data be9eb9a28830 zgrep 66058513 revel_b37.tsv.gz
1 66058513 65592830 A C Q P 0.153 ENST00000344610;ENST00000349533;ENST00000371060;ENST00000371059;ENST00000371058
1 66058513 65592830 A G Q R 0.034 ENST00000344610;ENST00000349533;ENST00000371060;ENST00000371059;ENST00000371058
1 66058513 65592830 A T Q L 0.125 ENST00000344610;ENST00000349533;ENST00000371060;ENST00000371059;ENST00000371058
Here are the files I have mounted (the latest CADD.pm and REVEL.pm files are in /Plugins):
$ pwd
/home/dnanexus
$ ls
InDels_GRCh37.tsv.gz InDels_GRCh37.tsv.gz.tbi
Plugins plugin_config.txt
gnomad.genomes.r2.1.1.indel.tsv.gz revel_b37.tsv.gz
gnomad.genomes.r2.1.1.indel.tsv.gz.tbi revel_b37.tsv.gz.tbi
X221099_subset.vcf homo_sapiens_refseq
cadd_whole_genome_SNVs_GRCh37.tar.gz homo_sapiens_refseq_vep_109_GRCh37.tar.gz
cadd_whole_genome_SNVs_GRCh37.tar.gz.tbi hs37d5.fasta-index.tar.gz vep_v109.3.tar.gz
I have attached the input VCF used (4 missense variants) and the output VCF. CADD_PHRED is annotated as expected but REVEL is not. My command:
$ docker run -v /home/dnanexus:/data be9eb9a28830 vep \
-i /data/X221099_subset.vcf -o /data/X221099_annotated.vcf \
--vcf --cache --refseq --numbers --format vcf --offline --exclude_null_alleles --assembly GRCh37 \
--plugin REVEL,/data/revel_b37.tsv.gz \
--plugin CADD,/data/cadd_whole_genome_SNVs_GRCh37.tar.gz,/data/gnomad.genomes.r2.1.1.indel.tsv.gz,/data/InDels_GRCh37.tsv.gz \
--fields Allele,SYMBOL,Consequence,IMPACT,EXON,INTRON,HGVSc,HGVSp,REVEL,CADD_PHRED
>> Unknown option: no_update
Ignoring unsupported option 'no_update' found via ENV variable or INI file
Unknown option: no_htslib
Ignoring unsupported option 'no_htslib' found via ENV variable or INI file
Unknown option: pluginsdir
Ignoring unsupported option 'pluginsdir' found via ENV variable or INI file
Unknown option: no_plugins
Thanks in advance for your help! Becky
Apologies, the annotated VCF from the exact command I had given is attached (the previous VCF was generated from a command with --force_overwrite
and not including --offline
). It seems strange that the only --plugin
information that remains in the ##vep-command-line
description in the annotated VCF is:
--plugin [PATH]/InDels_GRCh37.tsv.gz
and that no errors are given about not being able to find the relevant files. Is there a different way I should specify plugins in v109?
Thanks again for any help :) X221099_annotated_2.vcf.gz
Thank you for sending all these details, now I can reproduce the issue.
The REVEL plugin is trying to match the transcript ids from the file revel_b37.tsv.gz
with the transcripts from the cache. The problem is revel_b37.tsv.gz
only has Ensembl ids while the cache you are using has RefSeq ids:
revel_b37.tsv.gz: ENST00000344610;ENST00000349533;ENST00000371060;ENST00000371059;ENST00000371058
cache: NM_002303.6
We will update the plugin to optionally not match by transcript id. In the meantime, you could remove the last column from the revel file.
Hi Diana, thank you for looking into this and for explaining the cause. Just checking if there is an estimate of when the plugin might be updated / the next VEP release? Thanks!
We plan to update the plugin in the next release which is scheduled to happen in the summer.
Hi, I've modified our revel_b37.tsv.gz file to remove the last column like below:
#chr hg19_pos grch38_pos ref alt aaref aaalt REVEL
1 35142 35142 G A T M 0.027
1 35142 35142 G C T R 0.035
1 35142 35142 G T T K 0.043
And run vep v109.3 Docker and REVEL is now annotated mostly as expected, however I notice one of our variants now has the wrong REVEL score:
2 73717567 rs2056486 G T 3457.77 . AC=1;AF=0.5;AN=2;BaseQRankSum=-4.604;ClippingRankSum=0;DB;DP=261;ExcessHet=3.0103;FS=1.492;MLEAC=1;MLEAF=0.5;MQ=60;MQRankSum=0;QD=13.35;ReadPosRankSum=1.443;SOR=0.6;CSQ=T|ALMS1||SNV|missense_variant|MODERATE|10/23||NM_015120.4|NM_015120.4:c.8484G>T|NP_055935.4:p.Arg2828Ser||rs2056486|1|383761|Benign||Alstrom_syndrome¬_specified&Cardiovascular_phenotype|11646|30918|0.376674|64709|245736|0.263327|11776|0.252|58|194|1000|||||0.00|0.00|0.00|0.00|-25|22|-26|0|0.043|8.368 GT:AD:DP:GQ:PL 0/1:120,139:259:99:3486,0,3208
It seems to be annotating the score from a different nucleotide change at this position (G>C instead of G>T) within the revel_b37_no_transcripts.tsv.gz
file:
2 73717567 73490440 G C R S 0.043
2 73717567 73490440 G T R S 0.039
How I modified the revel_b37.tsv.gz file in case it's useful:
$ zcat revel_b37.tsv.gz | cut -d$'\t' -f9 --complement > revel_b37_no_transcripts.tsv
$ bgzip revel_b37_no_transcripts.tsv
$ tabix -f -s 1 -b 2 -e 2 revel_b37_no_transcripts.tsv.gz
Is this an issue with how the REVEL.pm file matches variants in v109 or due to me modifying the REVEL file itself? Thanks again for your help.
Thank you for reporting this issue! I don't see any problem with the way you modified the file, the problem is in the plugin. REVEL matches by peptide change which is the same in your example both G>C and G>T have peptide R>S. We are going to include a fix for this issue too.
We have fixed the plugin to:
add new option to not match by transcript id
The fix will be available in the next release scheduled for summer. I'm going to close this issue but feel free to re-open it if you have more questions.
Best wishes, Diana
Describe the issue
Hi, we are currently using VEP version 105 and are trying to update to version 109 (VEP Docker v109.3 + 109 resources). However, when running VEP 109, the annotated VCF is produced but the REVEL field is empty, despite CADD and SpliceAI scores being annotated as expected and no errors are produced.
Additional information
System
Full VEP command line
Full command being run:
Example record in output VCF:
We have also tried to annotate with only REVEL and again, there are no errors but the REVEL scores are empty:
and using the /data folder structure for REVEL:
Full error message
Any help would be appreciated!
Thanks, Becky