Open bartgrantham opened 2 weeks ago
Hi @bartgrantham, Thanks for explaining the issue so clearly, it really helps in understanding the problem. I've been able to reproduce the issue, and we're working on a fix. I'll let you know when we have updates.
I just wanted to let you know that this issue is specific to one of the RefSeq transcripts overlaping your variant. For now, a workaround is to run vep with only Ensembl transcripts.
Very interesting. FWIW, once I excised that single position from our data I was able to annotate the remaining 50M+ positions.
Out of curiosity, is it known what exactly it is about the RefSeq transcript that triggers this bug for this one position? It's surprising that it was a single position out of tens of millions.
For the transcript XM_040697338, the peptide sequence calculated here is incomplete. This causes a problem for this variant located at the end of the translation sequence.
Describe the issue
I am getting the following error and I've narrowed it down to a single line VCF:
System
I'm using the official VEP docker image id
607ee83f9536
(Ubuntu 22.04.4), containing the following versions:Full VEP command line
I was able to recreate from a completely clean install with the following on Debian 12:
Full error message
Data files (if applicable)
This single-line VCF triggers the bug, it was narrowed down from a much (much) larger VCF. The original had the usual headers one might expect, they are not needed to trigger the error.
tmp.vcf.gz