Closed errcricket closed 2 months ago
Hi @errcricket ,
Thank you for providing us the details for the issue. Can you please try using check_existing
alongside --fields
? This flag helps to identify known variants colocated with input variant. VEP's variant cache contains variants from dbSNP and other sources.
Thank you for the response. Can you first address this question please?
_When running VEP, if the --fields of interest are Consequence,IMPACT,Codons,Amino_acids,Gene,SYMBOL,Feature,EXON,PolyPhen,SIFT,Protein_position,BIOTYPE,AF,gnomAD_AF,CLIN_SIG,PUBMED, does that mean for example, for ClinVar, VEP annotates a provided vcf file with clinical significance information because (as implied in the info.txt file) the cache files already include ClinVar info?
Or do I (also) need to provide a ClinVar file that I have downloaded, and use the --custom flag?_
Hi @errcricket ,
Yes, VEP annotates the provided VCF file with clinical significance information from the cache with version mentioned in the info.txt
file.
While we try to include the most recent variant data in each Ensembl release, some projects release data more frequently than we do. If you wish to use the latest annotations from such projects, the data files can be used withcustom
flag.
Please let us know if you have any more questions.
Kind regards, Likhitha
Thank you @likhitha-surapaneni, I think I have what I need.
Greetings.
Trying to understand VEP a bit better. I am running VEP (version 112, perl 5.26.3) on a Linux server (CentOS) and have manually downloaded indexed cache GRCh38 from here. When I look under the untared
homo_sapiens/112_GRCh38/
directory in the info.txt file I see the following:When running VEP, if the
--fields
of interest areConsequence,IMPACT,Codons,Amino_acids,Gene,SYMBOL,Feature,EXON,PolyPhen,SIFT,Protein_position,BIOTYPE,AF,gnomAD_AF,CLIN_SIG,PUBMED
, does that mean for example, for ClinVar, VEP annotates a provided vcf file with clinical significance information because (as implied in the info.txt file) the cache files already include ClinVar info? Is that correct or do I (also) need to provide a ClinVar file that I have downloaded, and use the--custom
flag?The latter seems to be for a different file version than what comes in the cache, but I am not seeing clinical sig in the results -- so I don't know if I made a mistake or if it is because ClinVar does not have pathogenicity assertions for the provided variants. I have run VEP with and without the custom flag (both times with --fields list) and ran
diff
on the two outputs. There are differences, but they are likely related to # lines (finding differences from visual inspection was ineffective).For reference, this is the code I executed.
Thank you in advance!