Closed lydiayliu closed 2 years ago
I don't think this is fixed for me. Using #369 be6358b82c5cdf5e2768f2b827a52cda6d54f462
Are you seeing the same error?
yes
[ 2022-01-26 16:55:39 ] moPepGen callVariant started
[ 2022-01-26 16:57:05 ] Reference indices loaded.
[ 2022-01-26 16:58:30 ] Variants sorted
[ 2022-01-26 17:08:02 ] Exception raised from fusion FUSION-ENSG00000002822:20032-ENSG00000151229:57879
An error has occured during the function execution
Traceback (most recent call last):
File "/usr/local/lib/python3.8/site-packages/ppft/__main__.py", line 111, in run
__result = __f(*__args)
File "/usr/local/lib/python3.8/site-packages/moPepGen/cli/call_variant_peptide.py", line 203, in wrapper
return call_variant_peptides_wrapper(*dispatch)
File "/usr/local/lib/python3.8/site-packages/moPepGen/cli/call_variant_peptide.py", line 172, in call_variant_peptides_wrapper
_peptides = call_peptide_fusion(
File "/usr/local/lib/python3.8/site-packages/moPepGen/cli/call_variant_peptide.py", line 384, in call_peptide_fusion
return pgraph.call_variant_peptides(miscleavage=miscleavage)
File "/usr/local/lib/python3.8/site-packages/moPepGen/svgraph/PeptideVariantGraph.py", line 679, in call_variant_peptides
self.call_and_stage_known_orf(
File "/usr/local/lib/python3.8/site-packages/moPepGen/svgraph/PeptideVariantGraph.py", line 702, in call_and_stage_known_orf
self.call_and_stage_known_orf_in_cds(
File "/usr/local/lib/python3.8/site-packages/moPepGen/svgraph/PeptideVariantGraph.py", line 731, in call_and_stage_known_orf_in_cds
traversal.pool.add_miscleaved_sequences(
File "/usr/local/lib/python3.8/site-packages/moPepGen/svgraph/VariantPeptideDict.py", line 181, in add_miscleaved_sequences
raise ValueError('Invalid amino acid symbol found in the sequence.')
ValueError: Invalid amino acid symbol found in the sequence.
I'm also seeing it but only when running the full GVF files, not the extracted GVFs. Still trying to figure out why
There was a bug in downsampleReference that when downsampling multiple genes/transcripts, the order of genes/transcripts in the downsampled genome was not made consistent with annotation, so causing transcript sequences incorrect. I can not reproduce the issue in downsample GVFs.
This sample is clear!
This is just a gift that keeps on giving XD