Closed lydiayliu closed 3 years ago
CPCG0346.gencode.aa.tsv
[ 2021-07-13 20:07:16 ] moPepGen parseVEP started.
[ 2021-07-13 20:08:18 ] Indexed genome and annotation loaded.
Traceback (most recent call last):
File "/opt/conda/lib/python3.8/site-packages/moPepGen/parser/VEPParser.py", line 162, in convert_to_variant_record
return seqvar.VariantRecord(
File "/opt/conda/lib/python3.8/site-packages/moPepGen/seqvar/VariantRecord.py", line 62, in __init__
raise ValueError("Length of ref must match with location.")
ValueError: Length of ref must match with location.
The above exception was the direct cause of the following exception:
Traceback (most recent call last):
File "/opt/conda/bin/moPepGen", line 8, in <module>
sys.exit(main())
File "/opt/conda/lib/python3.8/site-packages/moPepGen/cli/__main__.py", line 350, in main
args.func(args)
File "/opt/conda/lib/python3.8/site-packages/moPepGen/cli/parse_vep.py", line 62, in parse_vep
record = record.convert_to_variant_record(transcript_seq)
File "/opt/conda/lib/python3.8/site-packages/moPepGen/parser/VEPParser.py", line 175, in convert_to_variant_record
raise ValueError(e.args[0] + f' [{self.feature}]') from e
ValueError: Length of ref must match with location. [ENST00000536027.1]
VEP entry
chr16_71922627_ATGCCT/- chr16:71922627-71922632 - ENSG00000182149.21 ENST00000536027.1 Transcript start_lost,inframe_deletion,NMD_transcript_variant 1-6 1-6 1-2 MP/- ATGCCT/- - IMPACT=HIGH;STRAND=1;SOURCE=GENCODEv34
That is an edge case that I didn't consider. Thanks for reporting it. I'll fix it once the rMATS parser branch is merge. Because some code in that branch could be useful.
Following code for launching MoPepGen:
Gives bug:
VEP entry producing the problem:
I believe this is an edge case due to the the deletion being at the very start of the transcript.
In VEPParser.py, line 132, this results in the
alt_start
being-1
.