Open splaisan opened 11 months ago
Hi @splaisan,
It looks like this ProtHint error occurred https://github.com/gatech-genemark/ProtHint/issues/14.
A quick fix is to remove the protein in prot_223256
from the input protein set.
I will look into patching ProtHint to fix this, but that may take a while.
Tomas
Hi @tomasbruna ,
I found the suspect file in the local spaln subfolder in my output folder and can easily remove it but how do I restart the docker run without recreating it?
cat nuc_223256
>6434_g
CGGCAGGTCCCAAGGAATCGGCAGCCCTGGCAGCTGACATCCTAGCAGCGGGCGGCTCTTACGTGGAGGCCCCGGTGCTGGGCAGCCAGCCTGAGGCGGAGAAGGGCACCCTGCTGGTGATGGTGGGCGCGGAGGCCGACCCCCGGGAGCCCGGCAGCCCGCACCACGACACCGTGTGGCCGCTGCTGCGCGCGCTGGGCCAGGAGTCCAACATCCACTTCATCGGGCCGGTGGGCACGGGCGCGGCGGTCAAGCTGGCGCTCAACCAGCTCATTGCATCGCTCACGGTGAGAGGAAGGGTAGGAGGGGGGAAGATAAGGAGGAGCCGAACAGTTGGGCGCTTGGAGGTTTGGGGATTATGGCCAGGCTGAAACGGGGTTGCTGTTTGTGTCGATGCCCCGGTTGCCCGACTCCTGCCTCTGCGCCCCCTGCCGCCTTGTCCCACCTACCCTTGCAGGTGGGCTTCTCCACCAGCCTGGGCCTGGTGCAGCGCAGTGGCGCTGACGTGGACAAGTTCATGAGCATCCTGCGCGCCTCCGCACTGTACGCACCCACCTACGACAAAAAGCTGCAAAAGATGCTGGACCGGGACTACGGCGCCGCAAACTTCCCCACAAAGGTGTGTGGACAACGAGACGCGCAGAGGCATGCAACCTGTCGAGCTCTTATGTCCACGCATTGTAAACTTAGCCAGCGACATCAACTGCGGATAAGCTCAACCGTGCCCGCACCTTGGTCCCCTCGTCTCCCCACCCGCAGCACCTGCTGAAGGACGTGCGTCTGTTTGAGATGGAGGCTGCGGCTGCGGGTCTGGACACGCGCCTGCTGGCGGCGCTGAAGGGCGTGGTGCAGGACACTGTGGACCGCGGCCTGGCCAACACCGACTACTCGGCGGTGTTCGACGCAGTGGCGCACCCGGGAGAGCAGCAGGCGACCAAGCCGCAGCAGTAAAGGAGAAGGCTAGGTGTGCGGTATCTGAGCTGGCGCCTGGGCGCATTGCGGCATGCGCCCAGGCTACCGGAGCACAGCAGAAGTCGCGCGGGAGGTGCACGAGCAGAGCACAGGGCGCATGACACGAGTAGATTGAGCAACGCAGCGTCGAATGATATATGTGCGCGTGGGCGGCGGCGACGGTGGCGGTAGGGTTGGCTCCGTGCCTGCTGCTATGTGCTTACTTTGCCCGCATCAGGCACAAGGGTAGATACGATACACCACCTCGTTGCGATGCAATATGCGCATGGGGCTGCCGCCCTGCAATACGGGTGGGAGATCAAGGGTCATCACTCATCAGAGTGCGTAAGCACGCTGAGAAGGCCAGAAGGGGCATGCACTCCAGAAACGCTTTGCTGGGGCGCGTCAGGCAGCATGTCACATGCCTCGCCGGAACGTGGGCTGCTGGGGCTCTGAGTCGAGGCGAGTTTTGCGACAGCAGTAACCTAGGCTAACCTCTATCAAAGGTGCCCGCAGTGCAGTGGCGGCGTAGTAACGTGGGTAGCGTGCGTGGGTAGTAAGCACACGGTTCTTCACTCCTGGGCGGTTTTGTAGCATTGAACTGTCCAGTGCAGGCTTTGAGTCCGCAGCAAATTTCCAAATGATAAGCTGCGTTTTCCGCGACGAGCTTCTGCAACTTGTGAGTGGTCCTTTGAGTGGTACTTCCTCGATCAAGGCATAACCTGGATAGCAAAACGCTACCAAGGTGCTTTCAAATATAAAGAACCACCTCATACGAAAACGCCGGCAGTCCCAAGAATTCATGGCCGCGAAACCACAATGCGTGCATGACACATACATCCCCCGCTCTTCGTTCATTTTTTCGACCTAAGAACACTGGAGTTCCAGTCAATAACGATTCAGAGTTCAACGCATGCACAAGATTTCGCCATGCACAAAGCCAGCTAACGCTGTTCTGCGCTTCTAATTACAATGTGTCGTCACTGACTGCTATCGAGGGACTCGCTGTTGCGTTTTTTACAAGGAATAATCTGCTTGAGTCGGACGCAATGAAGGAGGTGCGTTGGGGAGCAAAGGTGGGGGCGTTTTCGACAGAAGGGTCCAGGGCCAAGGCGTCGACCTCCCCGTCAGGTTTCATTCTAGATTTGCCAAATATTGCCAAATATGCCAAATATTGCCAAATATTATGATAATGATATTTGCATCGCACCTGTGCTAAGCGCGTATTGACGAGCGTGGGCGAAGCGTTTGTCAGCGGCACCTATGCACAGACCCGGCGTGCATACATTTGCAATAGGACTGCTTATCATCTAGATAAACATTTCCACCCACGGGTGTCACCGAGGATGGCCCTCGTCGCTTTTGCTTGTCGCCAGCCCTGCCGGGCTGCTTGCGCTGCTGTGGCCTTTCCCTGGCCACACTCTATATTGTGTTCCAAGCATATCTTGCATTGCGAACACCAGTTGAGAAACTTGCCGAGCCCGCTGTCAACACCCGCTCAACTGCCCCAAACTTTACGCGACCGCGACAAGCACTGAAGATTAAAGACCACGCTGTGGAGAAGTCCTCCGGGGCTTCAAAATATGGCGCCGTGACATGCTGGTTTCTGTTCGGGCGTCCGCCCAATTCAGCCCTTGAGGTACACTCACGGCTCTGCCTGCGAATACCCATCCACACAGAGGTACGCGGTCATCGCTCCGCTGGGTTCCGGCGCCTATGGCTGCGTGTATAAGGTATGTGGAACAGGCTGGCAGCAGTGTGAAGCGGGGACCTTGGGAACAGCGGTGCCCCGTGGGAATTGGCGGTGTGACCTTGGGTGCGACGGGACAGGGTAAGAGGCAGGTTGGCGTAGCCCCGTGGCTGATATAGCTGGGTCCCGAGAAACAAGTTACGCCCAACCCAGGCGCCAATGAGACATGGAAATACGTCGCCTCCGTGAGAATCGTGGGGTGAGAGACACACTCATGAACACGCCTCCCCTCCTCTATCCCTGTAGTGCCTGGATCGCGACACGGGCAGCCTGTGTGCGCTCAAGGTCATCAACCTCGCACATCAGGAGCCCGCGGTGAGTTCCAAGCACCACAGCTGCACCAGTCAGTTCTGAGTGCGGGGCCACGCGGCTGGCCCAGCTGCCCAGCATCGCAGAGGCGTGTATGGCATCAGTATCCTTGGTCACCGGCATTCCTGCGATACAGCGTTAAACTCCCCATCACGTGTTTACGCTGGTGTCGAAACTGCTGACGTGCCTGTGTGGGTGGGCGCCGGTGCACAGGTCATGCGGCTTACCATGCGCGAGGTACGCACGCTGCAAAAGCTGCCAAAGCACCCGCACATTGTGGAGTTGAAGGATGCGTTCAAGAGCTCGGGCAGCGGCCGCGTGTTCCTGGTCTTCAGCTGCGAGGGGCGCAGCATGCATGAGGTGCGCGATCGCGGGGCAACGCGTGAAGGGCGGACGGGAACCTTCAGCTGTTGACTTCCCAAGGCCCCTCCAGGCTGCCCCTGTCAGCTTGACTTACTGACTGAGCTGTATGGTATGCCGCACACTCGCGCTCCGGCAGGAGGCGGAGAACTACGCCAAGTATATCCTGCCGGGGCCCATGCTGCGCCAGGTGGCGTGGCAGTTGCTGCAGGCGCTGGCGCACATACACGAACACCAGGTGCGTGTGTCCAACCGAGTATGTGCAAGGCGCGTTCGTGTGACTGGCGGTCTGTCGGGGCGCGTTGTCTCCAAGCCCGGGCGTATTTCAGAGTCCTGCTGACCGCGCGCCCACCACCGCCCACCACAACCCTCGCGCGTGCATCCGCACGCACCCAGATTATCCACCGTGACGTCAAGCCCGGCAACATCTTGCTGGTGGGCGACGGCACCGGCGGCGCGGCGGGCGTGGGCCTCAACGGCGCCGACGTGCACATCCGGCTGGCGGACTTTGGCTTTGCCCGCAGCTGGCAGCCGCACGAGGCGTTGTCCTCCTACGTGGCCACGCGGTGGTTCCGTGCGCCAGAGGTGGGTGCCGATTTCGGTTTTGATGGTTCTTGTCAAGGTGTGGCTTGCTGGGGCGCATGAGGTGGTTCGGTGGCAGCTGATCCAGCGCGGTGGGCCGTGTCTCGGGTGGGTGAGCGTTGCACCTGCGGACACCGCACGCTAACCTCCGCACGCGGGGCCTGCGGTCGCAGATCCTGGTGCGTGGCAAGTACAGCTTCAACAGCGACTGCTGGAGCGTGGGCTGCACCATTGCCGAGTGAGTCGCGCGTTGGGGCTTGGGGCTTGGGGCTCGGGCCATGCACTGCCTTGTCGGCTGAGAGGGTACGGTATCCAATCGCGTAGCTGCGCGAGGGCGTGGCGGCACGGTCTCGAACCCACGCACGGCCACCGCACCACTGACGCCCGGACGCTCCCTTCCACTGCCTTCGTTACGAGCAGGCTGGCGGTGGGTTCGGCCCTGTTCCCTGGCACGTCCACCATCGACCAGCTGGCCCGGATCATGCGCGCCACGGGACCGCTGCCGCCCTCGTTAGCGGCGCAGATGATGTCGGACCGAACTCTGAGCCCGCTGGCGGCGCAGCAGCGGCGGCCGCCGAACCGCACCCTGCGCGAGCGCCTGCCGGTCGAGGCCCGACTGTTTGAGTTCCTGGCCGCCTGTCTTCAGGTGGACCCGGCCCGCCGGCCCAGCGCCAAGGAGCTGATGCAGATGCCGTACTTTTGGGACATCGTGCCGCGCAGCCGTGCCCTGCCCAAGGCCTCCATGGAGGCAATGGCGGCCGCACGTGACGCCGCCGCCGTGCAGATAGCGGCGGCTGAGGCTACCATCGCAAAGCCGGCGGCGCAGCCGGCGGCCGTGGCTGTGGCCGCGCCCGCGGCGGCGGCTCGCAAGGACGTCGTGCAGGTGGAGGCCAAGGGTGCGGCGGCCGCGCCGGCGGCATGCGGCGCGGTAGCGGGCGCAGCTGCCAAGTCCAGCGGCACGGACAAGGCGGCGGCCGGCGGTGCTGGGGGCCAGACGGCCTCGAGCAGCGTGGCGGCACCCATGACTACCACCCGTACTGCAAGTGAGGCCCAGGCCATGAGCCTCTCGGCCGTCGCTTGCTGCCCGGGGACTGACCGCGCGTCGACAGCGGTGCCCCCTACGGCGCCGGCGCAGCTGGCCGCTGCACCTGCTCAGGGCACAGCAGCAGGGCTCAAGCCTGCAACCAGCGTGGTGATCTCGGTGAAGGCAACTGCTGCGTGCGGCCGGGACCAGCCAAGCGCGCCGATGACTGGCTCGAGCCTCAGCACCCGCGACCTCGCGAGCATGAATCCCGCCGCTATGCCAGCGCCTGCCAACTCACAGGGCAGTGGTGTGACATCGGTGCCAGCATCTCAGGCAGCGGAGCAGGCGGCCGCCGCTCCCTCAGCGGAGCCGCCCCCACGCGTTGTGTGTATGCCCGACCTCACCAGCGTGAGCACCTTGGCATCAGGTGCGGCGGGGCCGCAGCCCGCGCAGCCGGCGCGGGCGCGCGCGCCGGCACCGGTGGCGGACGCGTCGCCGGAGGACGCGTCGCCCCGGCAATCCAGGACTGAGCGCGAGCTGCAGCGGCCACAGGCGGCCGTCACGCTTGTCACTAGCTCTAGCCTGTTCCCATCACCGCTGCCAGCACCGCTGCCTCCGCCGCAACCAGTTGCAGTGGAGGCGTCGTCGCCGTTCACGCTCGTGGTTGCTGACACTCTGGGTGGCGCTGCGGCGGGTGCCGCAGGCGCCGCCGCCCCAGGCGTCGCAGGCGCAGTCGGCGGTGACAGCACGCCGCGCAGCCACACCACAGCGCGCATGCTGGACCTGCCCTCCAATACCGTGGAAATGTTCATATCGCCCACCACGTCGGTGGCAATGCATCGGCTGCTGCCAGCTGTGATGACGCCAGTAGGCGCACCGCCGCCGGCCACGCCCAGTGCCGCCGTGCGCTTGCGGCAGCTGATGCCGCACTGCCGTGCGCCGGCGGGCGCGGTGCCGCCGGTCCTGACCTACGGGATGCTGTCACGCAGCAGCACTCTGGAGCTGGACATGACGGGCAGTGCGGCTGCGGCTGCAGCGGTGGCGGCCGCTGGCGTTTGGGGAGGAGATGGAGGAGCGAGCGGGGATGGTTATGGCGTGTCGTTGGCGAATGGGGCCTCAGCGGGGCAGCTGCAGGCCCACATACAGATGCAGCAGCAGGCGGCGCAGCGGCATGCCCCGGCGGCGGCCGCGAACAGGGCGTGGCGGCGGGCGGGTCGCGCGTCGGTGGAGTTTGCAGACCAGCTGTCATGGCCGGCAAATACCAACCAGCCCGACCAAACGGTCAGCGGCGCCAGCACAAGCAGCAACATTTGGGCCAGGGCTGTCACTCCTGGAGCCGGCGCCGCGCGCGTTGGCGGCAGCGGCGGCGCAGCCGCCACTGGCACCCGAAATGTCACTTCCGCCGCCATTATGCGTCGCAGCTGGCGGCTGCTGCCGTACCGCACAACGGGCGGCAGCCCCGGCTTCATGCCCGTGCCCACGCTGGGCGACGAGCCAGCTGCGGATACGCCGTCCCTGCACACGTCAGGCGCGGGCGCCGTCGCGTCGTTGGTCAATGCTGCCGCGGGCCTGGGCCGCCACAACAGCCGCTCGCAGGCGTCCTTTGTCCGCAGCATGTCGCGGATGTCGCAGTGCCACGCGATGCCCTCGGGCGCCCTGGACGTGTCATCTGCGGGCCATGACAGCTCAGTGGACGGCGCCGGCGGCTTTTGCTCCGCGTACGCAATGGCGAACGCATCGGCTGGAGCGACATCCTCGCCACTTGTGGGCCTGGTGACCACGCCGCAGCAGCCGGCTAAGGCGCAGCAGCTGCAGGCACAGCTGCAGAGAAACGGGTCCACAGTCGGCGGCGCCGTCGCACAGTCGCCGCCCATGCTTTACGGCCTGGTGCTGGCCGCAAGCAGCGATTCGCCGTCCCGCACGCGCCGCGCCGCGAGCGCCGTGCTGCCCAGCTTTCCTGCAGCCAGCGTACCGGGAGCCCACGCCACCCCTGTCACGTACACTGGTGCCAGCGCCGCTGATGCCAGCAGCAAAGGCCCCGCCAGCGTGGCTGCAGCAGCAATGGCCCTTCTCCTGCGGTCGTCGTCCCAGCAGCAGCGTGCTGCCACCGCCGCAGGGCATGTGCCGCACGGCACAAGCCGCCTCGCAAACGCCGTCAGCTCCAACCTGTGCGATTACCCCTCTGGGGACGCGGACATCGCGCCCACCGCCGGCACCCCTCAGGCGGGAGCCTCCGCCTCGGCCTTTCCCAGCGGCACGCCGATGGGCACAGCGACCGACTCGGGCGCCGTGCGTCGTGCACTCGGCTTGTCCTGGCAGGTGCTCCAAGCCGTGGGCTGCAGCAGCAACGCCGCGGCAGCTGCGTCCACGGCCTGCTTCGACAGCGCCGCCTCCGCCACCGTCGCAATGGCACAGGCCGGCGCCGTGTCGCTTGACGCAATGCTGGCTACTGGAGGCGGCGATGGCGGCGGCGCCCCTGCAGATTGCGGCCTTACCGCTTCGGCGTCGGCAGTGGCACGCTTCCCCAGCGCTAGCCTGCTCACGGCGGGCGGCGGGGCCGCCAATGGTGCCTACGTGCCCCACGCGATTACCGAGGAAGAGAACGAGCTAGCATACGCGGCAGCAGCGGATGCGTCAGCTGCCGGTGAAGCTATGGGCGCGGGGTGCAGAGCCAAACATGTGCTGGACAACTCGGATGGATGCGTGCGTCTGGCTGGCTCAAAGGACACGGCAGCGGGCATGGCGCACCTGCAGCAGTCTGCCACCACGCAGCATCCCTTGCCTGCGCGCACGGCATCCCCGGGTGGACGCCGCCAGGGCGCACATGACAGCAAGCAGCGGCCAGGGCTGCTTGCCCGTCTCTTCGGCTGTGGCCGCTTTCGCAATGACCAAATTTGAAGCAACATGTGAGACAGGCCGCCGCTTGTCGGATGGTACGTGTTGGCAGATTTGACACGGCGTCGGGCCTCGGGCCCCGGTTGGGGATAGCAGTGTGTTTTTGGGTGTGGCGGGCCGGACCCACTTGACCGTACGGTAATGCTTAGGTACGGAGCTCAGGGTTCAGGCTGTGCACTTGTTTCTTCTTCTGATATGAATGCGACATTGCATATGCAATGAGGTACAATGATATACTGGGTATTTGCTTTGCCTTGGACGTGAATGCAGTAGCCGGACATGGAACATGGGTTTCGACATGACCGTGTGTGTTCGCGGTAGAGTTGTGCACACACACCAGGCTTGCCTAAGGGTGGGCATGGGGATACTTCAATTAACGAAGGTCACGTTTTAGGAGTGTTTTTGGGCGGAGCGGGAGATGAGGTAGACGCTTGCGGCCCCAGACGGGAGGCGTCAACTATCAAGTTGATCCCATTTATTCCATATGAACATGGCTGTAATGATGCGGCCCGTGGAAGTGTGAATGGGGGGCTGTTCCATGGATGGGTGAGTTTAAATGTTCCCGGTCGCAGTGGGCTCTCGTGCAACCAGGTCCGGATTTTGCGCGGTATGGCTAATTGGTCGTGCCGACGTGAACAGGGGCAGCAGTACGTACTGTCCGTTTGTTGCATTAGCATTCATGATTAGGGGAGACCGCAGCATTTTAGCCCTGGGGCTAAGGTTGTTGAGAAAGAGCACCAGAGCATATGGAGATGTCGCTGTACTTCGGACGAGTACGCCTGGAGGCTGAAAGGAACCTTGCTGCGGTTTGTACGACGCAGACAGATGCTCGCACGGTCTTGCAATGCAAGATGACGGTCGAGTCGTATACGTGCCATGATGATGTTGTTTAATGCTTCACCAGTTGACCGATTATCGCTGATGGGCGCTACAGACAGGGAATGTCCTAACATGGACAGCTGCGAGCAGCTCATTGCGCTGGAGTGTGAATGGAGCCAGAGAAGTCTGAGCAGCCTTGCAAATGGAGATGCGCAGTATGCTTGGTGAAGGAGCTAAGCCCTGCATCAAAGGCCGGAGATATTTGGGGTACATGACGCAGGTCACGAGCTTGCGTGCAACCACAAGTGTGGTCGTCCAGCTTTAGATCTGGGGGGCGTGCCAACAGTGACCCCCACGCACGTTGGCCGGAACGTGTGTGTGGGGGGGCTCGGGTTCTAGTTGGCAATGGGTGCAGGCGGTGCGGTCTGTGCGAGGCGGGAATCTTTTACAGTTTGCCCAGGGGCGGCAGCCGCTGCAGTGTTGGCTTGAGAAGCAATGTCTTAGGCATGAGATGGGAAGGGAACATGGGCAGGGAGCTTCGTGACGTGGGGCCGAGTGAAGGACGTACTCTGTGGAGTCTGCGCCTTGGGCTGGTATGCGCTGCTCCCATGAAAGCGCCACAATATGCCATGGGATTTTTGTCTGATGCCTACCAGTAATCATCTATCAAGTTGGGACCTGTACGTCATCTTCTTCCGTCGCTTGGTTGCCTCCATCTGCAGGTGAGCGGCCAAGCACAGCCAGTCACAGCTAGTTGCTAGCGTACACGTTCCAAACACTATCCTACCAGCTGTTGTCCATGCAGCCGTCCGCTGCTGTTGCCTGGCGGAGTGCTGGTGCACCCTGAGGTGTGACCTCCACCTGACCCCTCATCGTACAGCTACCAGTCTCAACCCGCGTGCCGGTGCCACTTTCCCGATGGCACCACAGCGCACCACGCTCTCCTCCTCGTGTCCCTGCCACGGCTGCCAGCAGGGATGCCAGCCCATGGCTGCCAGCCTGCGCAATTTGATGACTGACTGCTCCCTCCGCCCCGCAAATTCGGGTACCTTCACTGGAAAGTGCGTTGCAGTCCCCAGTCGCAGCAGTGCGAGGGTGCCCAAGGTTGCCCATATTGCAGTGCCATGCTTTGGGCTTACTTGAGCCATCATTTCCGGACCATGCTGCACCTGGCTGCATGCATC
cat prot_223256
>3055_0:000816
MENYEYLGDLGSGSYGFVWKCVQRSTGRVVAVKGFKLAHTDKKFLDAAIREVRMLRNATDHPNIIQLLEAFRSSTGRVYMVFEFADKCLSAELHKRFTCGLPAGQTRVVLWQVLAAVAHLHSKKIIHRDIKPGNILMTSDGVVKLCDFGFARLTRGDPYQPDRFSSYVVTRWYRSPEMLVSDLYGAPSDIWSLGCTFAELATGRPLFPGASSLDQLWRIMRCMGPLPPTQAERFAAAATAAGLPEAPPPPPRGKSLWQRLPELDSRLLDLVQACVRLDPAQRPTAVQLMQMPYFHEIPKAIAGSRLEQLYLAIGSGTGYPGSALGRTASARFRQMQQLAAQQKAAGAAAGGAGSGTQPNVASVPAGGSAGVRGLGGSVTVSVMSPEELLASPRGGHATSGSVKRPASVLLSSVAEAVLGEKPSAGDGSGDCSIFPLAPPLPHIPMVDIAMLLSAQQQQQVQPQHQLQQAPLQGSQRYAAASAAAVVPLAATAAAGPSSSRLHSVSSPFKTVPMLPPLQPAPTSGDVVMPAAAAPIIAAAAASAAMSQSPRSSASMSSPSPHPPGTRRQLSGTSPRGAAPAGTASGRNLLAAATAGAGAAAGRQASGRGLPMGGLVGGVAAPESTGGGSSPTAAGVAVAVPPSVRLAHLSSLSPRQRQHLPQLSPLQRQQQSQALPAAATSVAMPPSAFLDAEARGDSLGSGSGGDGEETDDEILAARQGCRRNRQGYERDGSASRLGRNAGGAVPAGAAAMATATGGAAAAAALPPASASIMPVEAHAMPGLGLLEGYDAQDTSDDDEAQVSDDDELMAFYVARKSGGRGRRGGAAGTRATGSRRKVASAAAASTGALTTPAPAAASAAAMSASGAMHGAAAPAAAATKAAAAATRDELIGVALQAAAAVDMATQEMHMAGSTGGGMQPMQMEADAGMSLHVAAATTAAPLRGAHHNGVAAVDAAAPSPALASWPTAAAPAAGIIAASGLGPRAVAAQPPQRPLPHAGIHQQHHGLYGTQGSHHRQTMPRTTGGGGSSRGSTGTGATPVAAGLNRRVTAMVLGTGLEDAVSHASAAANPNTAATGTPASAAVAAASAPAQPRPLAPAALASCSPTPAVTITSAAATPVVAPLPPPPRFPTGAVAKRATVASYLAISQPNGSMAVTSASVLASGTSATVATADAAVAASSGTTVSQPLPVPRSVARGGQGAGMSGGIIVGTDTGGTGPVAGAVRGAATATGLTHMGTGSLPTVGSIGPGLRHHNHATTMGLTLMAPHESGPRGLGGGAAVTPSAAGVHLQGHGPASLPYGRASLPVQGGSYVGFSTGSANRRMLSRQGSTVFNQLMYDALPEIGTPGGAPDVPAGTPPPQRRRAVMSGFTPCRTAAARAAAEGLPAAAAMAAAMGSNTTDLSVAFSPIAVARHEDPLSIGDGHGLERSSVGAAPGFRSVQFGLACGAGGAYPGASAAGGAHRRQASMQMQTAYTASVGIMGAAGSDLGPSAATAIPGGGAAGGRGSGSYHSHASDTGMLMGSSAPVSHAMHPGYGSGSGSMGGSYRWPGQRILVPDQAHGLATATVTAASGPAGGPPVRGGRLPQAVGLAASGSSQQTSGSAASGAGPLGSGTTVGAAAGAHAAAATPGRSRLGSGILGRMSDDPAGGSMLGAGVGAGAGGGGSHGQHPVLVCTADDVHCSSALNIELDGSCSVGNNTGGGNSAGMWGFGPMAGYPAGAGAASGAVIAARAGGGGRSRWLGSGVIDSLPEDREVLHVAGVDDWRLGNSPGIAGGAGSGVGMAELVLGASDHYSSGLPPAPTSGPTLAEVSAAVAGAILAPSSSSAMGFGYKLSPRGQPATIPGQAGLMGLRPKSPAGSLELLRGRTNGHAGQASYGHGPSGLHQAGGALGSPSSPRSPGSGDAPGRPGSAQLPLAGDGSGMRFAANGSPSRAWVTEGCAAGGGTIGAAAVADVGAAAGAAGKLASADKAEKSKWPRAKALLGGKLISSLVKKFKDGVQVSDRK
Thanks in advance Stephane
never mind, I figured out that this protein was well created from the Viridiplantae.fa input after all.
Here is a bioawk command to zip it for other having the same issue
mybad="3055_0:000816"
bioawk -c fastx -v header="${mybad}" '{if ($name != header) print ">"$name"\n"$seq}' ../Viridiplantae.fa > ../Viridiplantae_edited.fa
Same problem occurred with this sequence (It has been stuck for 14h 30min in 5975wx system). I hope this could help solving problem
Command
braker.pl \
--genome=../202_repeat_mask/${ID}_masked_nuc.fasta \
--species=${SPECIES} \
--prot_seq=/data/genome/db/orthodb/odb11v0_all_fasta_no_asterisks.tab \
--GENEMARK_PATH=${PATH_BIN}/GeneMark-ETP/bin \
--PROTHINT_PATH=${PATH_BIN}/ProtHint/bin \
--AUGUSTUS_CONFIG_PATH=${PATH_ENV}/braker3/config \
--AUGUSTUS_BIN_PATH=${PATH_ENV}/braker3/bin \
--AUGUSTUS_SCRIPTS_PATH=${PATH_ENV}/braker3/bin \
--fungus \
--threads=${THREAD} \
--softmasking \
--useexisting
>2090_g
CCTTTGGTCGATCCGGATTTGCAATTAAAAGTTCTGACTCGAATATTACGCAGCGAGTAGCAGCAGGCTTGCGTGGACACTTCTTGCGCAGCAGGAGAGTGAAAGAAATGATTACATCCCAGAACCGTCGAGTTCGGACTCTGACGCGTGCTTGTAATCAACGCATACACACATCCTTCTTAATAATACACAGCGTACACTACACATACTTACAAAGAATTAGTCGAGAGGTGCATTACCAGGGCCTCTGGATGTCCGCCTTCCGAACTGACTCCGTGCAAAGGCCGAGCTGGAGTTTCGTTTCATCTTGAGTGAATATGCTTTCGCTTTCGAAGGGCAAAAATAAAGCGTCACGTACAATCGAGAACCCGTGCCACGGATGTTTGATAACAACGCGCTCTCCAACGCGGAAGCAGACAAGCTTGTAGACTCTGCTGGTGTTCTTCAGAATACGACACGTCAGCTAGACTTGTAAGCACGTACCACTGATGTCACTGAAGCTCGATGCACTGGATCCATGCGTGCTGCTCCTGTCGCTAGTATGCGCCATCCCAACCGCTATCGGTCTATACTGCATGTGTGGTAGACCCCTCGCCAGACCCGAGGACAGAGCAGGCGGGCTAGATGACTTAGTTTGTGTGAAATGTCGTGCTACAGGGGACTGCGTCTGGGATTCAGGAGGTGGAGATGGGATAGACGGTATGGAGCCCTGAGGAGAGCTCGTGTTCGAGTTGAAGTCGCTAGAGCTAAGGTCTCTCCTGGCATTCCCTGTATACGTCCGGCGCAATGGTGACGGAGAAGTGGGTGACGTGATGTCGCGTCCGCGCTGAGGCTGCTTGCCTTTCGACTTTGGGCGGCCATCTCGCACAAGCCCAAGAGCGTCACTTGTCAGTACAGACTGAAGATGCTGCAGTTTCTTATCGAGTGCCTCTTGCTCCTCCAAGCGGCGTTCCTCCTCCTCCGCTTTCTCGGCTTCTTCATCTATTTCAGAGTCCGAGTCTGAATGCGCCGGTGACGATGTCGACGTCTTTATTGGCGGTGGTACATGTCGAGTACCTTGCAGAGTGAGGACGGAGGATGAAGACGTCTTCTGCAGCTTGGATCCTGAATCATAGCCTAACGAGTTCAGTCGCGCACGAATGCCAAGGGGCGTATTCAATCGGCCAGATGAAAGCAACCTGGCACTTCCCGCCGTCCGACCAGACGTCCTTCTGACCAACTCTGGACGATCCTGGCCCTTGACGTCTACCTCAACGGGGCGTGCTGAAGGAGGACCCGTGGCAGGTGCGAAAGGAACTTGAAGATGTTGTATGCCCTTCAAGTCCTCTTCATAGCGTGTTTGTGCTCTGAATAGAAGATAGGGAAGAGGAACTGCTAAGTGTGCTGCGAGACCTTGCCCTTGAATGGTTCACATTGGCCTGTCAGTATCGAGAAATACCAATATAATCAATAATCCATCTCACAATCCGTTCCTCCGCTATCTGAAGCTCTTGAACGCGCAATGACCTCCCATAGGATGCTTTCCTTCTCAGCATTCCATTCTATCTAGTTTTCAATGAGTTGGGTGAACGCTATCCAAACATGCTTGTGAATGGACCTACTCGTGGTGGGTTCTCGTAGCCTTCTTGTGGTCGATTGTACGGTAGACGGATGATAATGCGCACGGATGGTATCGCCGATGAAGAAGGCATTGCTAGCATTTACATGCAGGCCTTTCGAATGCCAGTGCTAACAGTGAAAGTTGCGTAATGTTACAAGCCCCGTGTCTCCGTGACTCCGGATTGCCTCACCCCTCCACACCACTCACAAATTGTCTTATTCTGCCTATGCCTCGGGCCTCAGCGCCGATTTTGGTCAAAGTAATCCGGCTTGCCTCAGCCTTTCGACCAAAATTTCGTCCCAAATTATACGGTTTCCTCCCAGACACGTACTGGTGGAATCGCCGGCGGGGATCCTTCCACATCCCACGTCGATCGTTCAACATCCCAACACGAACCACCATGACCGTGAGCTCGACTACTGCAGATGAGGGCGAGGAAACCAAAAACGATGCTCAAGAGCTCGACGAACTACTGGGGAATATGGCTCTCGATCCTGAAAATGAACAGGTCGTTTCAGAAATTGGGGGTGGGCGGAGCTTTCTCTCAAGCGACTATCCCGTGCCAATACAAGTTCTTGTGATCTCCCAGTGGTGCGATCTATCTATGAAGGCAGCAATGATGCAATAAGAGTCTGGAATCCGAACAATTCTGAGAGTACGAGTGCGAGCAACAACTCCGATGGCAAGGTAACGCGCTTCGAGGTTCACTTAAGGTACGCTTAATATTGTTTTCGTTGTGAACGATTGACCAACGTGTATACAGCTTATCGTCCTCACAAGACGGTCCCGAGAGCCGCTTCAAGGTCCTTGTATCGCTGCCACGAACATATCCATCTTCATCTCCGCCACAAATTCAGCTCCTGTCGCGTTACATCGGCGCGTTCAGTGTTGACGCAGACCTCTTCGGAGCAGTCATTCGTACATTCATCTCATCTAGAGATGGCGTTGAATGGCTTCCAGGTACAGAATGCATCTTCGATGGATTAGAGAACATCCGGGAACGCGTTGCTAAGTGGTACGACGAACGCCTTAGTGAAGAAAAGGCTCTGGAACTCGTAAGAGACGACGGAAAGGAAGGGACGCACGAAGACAAGCATCCGACCGACGAGATTGAATCTGTGGACAAATCCTCCAAGGGTCGCAGACCACAGGCTTCTCTGCCCGAGGGCATTGTTCTCCATGTATCGGAGCCTATCGTTGATCGGAAAAGTGTTTTTATAGGACGGGCGTGCCGAATATCTCATCCGTCTGAAGTATGCTTCAAGCTCCTTTTCTCTTTCGCGTTAAATTTTAACTCATCTGGTGTTTGTCCAGGTTGACTCCGTTTTATCGTATCTCGTCGCGGACCGAAAAATAGCTCGCGCTACCCATCCGGTTATAAATGCCTGGAGATGCAAAGTGAACGGAACACTCCACCAAGGTAAGGCTTGTTGCGTTTGTATCTGTCTAGTCCAATGTTGCTTAGATTAATTCTTCCACTCAGACAACGATGACAATGGAGAAAACGCCGCGGGGAGCCGCTTGGCTCACTTACTACGAATTTTGGTAAATGTTGCTCCTATGGACGCAGTCATCAATTCGCTCACTACGCATTTTGAAGGACGTTGATAATGTCCTTGTGATCGTCACTAGATCCTTCGGTGGCATCCGTTTGGGCCCCGACCGTTTCAAGCATATTAACCAAGCTGCTCGCAATGCTTTGGAGATAGGAGGATTCTTAGACGCACCAGATGATAAGAAGAATACCTCAAGGCCGAAAAAAAGACACTAAGATCATAGAATGGCTTCAAATACAGTGAAATGGTCAATAAGTAGTACAATGTTCGCGATCGAGCTAGCTCTGAAGTACTGTTTGTCACGTGGCCTGCTTGGAGAAAGATCACTGCTTGGGGCTGTTACACTGACACTGCATTCCCAACTCATGACTCCGTTCTGTGCTGTGTATTGCCGTCCACGTCGAAGCCACAGCTGAGATCGCTGACTCAGTATGATATGTAGGCTGTGACAAGTGTATGCATCTGGTTTGCTTCTTGTACGCATAAAACATCTAACGTGGCCAGGCAAGACAAACAGGTCATGTTTAACAACTCTTACCGCGTTCATGAGCTAGCTGGCAAACTCGACAATACATATCAAGATTCTCGCGAACCTCTGGTCCAGAAAAGCCTGGGTTCTGAGCCATGCCTTTTGGCGAGGCTGCCTTCAACCAGTGAGTATTTTCTCAGAGACATTCAGACCTGCAGAACGACGTCTCACGTTGCGTATAATTACCAGGTTTCTTGGTGCGAGAAGGAGAAGCTCTTGCTCCGAGGGCAGTTCTCAAGCAGCTAAGACGTAGTCCGTCACCACCGACGTCCCCGACGGATACTGATGAAGCCGAGAGACAAGAGCCAAGTAAGTCGAAGACACAGGAGACGGAAAGGAGATCATGGTTTTCTCCAAGAACTGTTAAACGTCCTCAGACCACGTGGAAAGAGCCTCAGGTTTGTACTTATGTGACTTCCGCAGAAGCAAGTGCCTAACAAACAGGGTGTTAATATAGTTATATGAGGTCTTTCGTGCAATTGAGCGGAAGGACATCATGTTTCTCATGGAGGTACGGGATCGAGCATTTCATGTAAGCACTTCTCAGGGCATTTTTATTTGCTATCTCTGACTTTATCGCAAGCTTTTACTCAAAAAGAGTGGGGATGCGACGCCACTCGTACACGCTATGCGGATAGGCGATTCACACCGTGACGTCGCAATTATCATTCTCGGTGCCTTGTCGCGATGGGTTAACCATTTGGAAGACAGCGACATGGCCGACAAGCGAACGAAACCGTTACTCAAAGCTTTGCGTGAGCCATCTCCACTTTACTTTTTACTGGTGAACATGTATATTCACAGACGAAGCAAGGCACCAATCTGAAACTCGCCGTTGACTATGGCCTGCAGCGCTCGCAATCGGACCTCATCCCTTCTTTCATGCAGACCCTGGTCATGAGTGAGGGTGAAAGATGGATTATCGATCAGACGCATAACGTGGCACTTGCACTTCGTGCTGGTACAGAAGGGAAACCTGTTCATACCGCTGAAACTGTTGTTAGGAAGTTCGCGACAAGGGAGCTTGGCAAGGCCGAGCTCATAGCGTCGTTAGAAGATTAGTAAGTGTTTAGCGCATTTATTCACGTTACCGACTCCTCAATGTTTGTCTGCAGCATAGCCAATGCCACTGCAGATCTGTTAGTCCTAGCCGCCTGCTCATGTGTCCTTGATTCTGTTCAAGCGGAACCTATTCCGGTGCGAGTCATTTATGACAGACTTCGGGTTTGATACTTACCATCGGCCAGACGTATTACTTTGCACGAGACACAAGAGTTTTCAATGCTTTCCAGGAACGTCTACAACATCACAAGGGGGCTCTGATGGGTCTTAGCAAACGCCTAAGGTGGCAGATTAGGGTCTTGGAGCACGTACTAGAAGGGCGGTTTAACTCATTCAGGGTATGCATTGCCTGTATTACAGATTATATGGGCTGATCCTTTCTTTTTTTTCTTTCCAGAAAAAGGTCGAGTTGTTGGCCTATGAGCTAGACGAGGGTCCAGGAGTATGATAATTCTGTGCTTTCAAGATCATAATTTTTTGGCCATTCTATGAGCAAGCTGACCGAGTTATTCTCCGTGCTGATTGATGTACACAGCACTAGTAAGAGTCTGAGAGCTCCCCAAAGAATTTTGCAATATCACGCTCGTATAAGGTGCCGCACGAGTAAAAGTTCTCAACTCGAAG
>93625_1:000408
MRRSPSPSTADADNTDYALELQDFLAELSQDPEREAVASEIQVLQSIYGDDAIRLWRPPLKNGKRSASTSRRDGTIRYEVLLSLSSPHDDVSLKVLVSLPETYPKSSPPQLQLLSKYIGSFGADANLFGSILRTYISVSGVEWLEDTVCVFDGLQNVLDRCVSWYEDRLSAEKAGELVRDDGKEAVAVSTRPVSPTGQTNAEISGIADSAPAPVPNALPIGIHIYVAEPITDRKSAFVGRACRIHHPSETRFMCAELFAFKVPLILSHLMSDRRISRAAHPIINAWRCQVDSVLHQGSSHNDDDGETAAGGRLAHLLQILEVNDVLVIVTRYFGGIHLGPDRFKHINQAARNALDLGGFLDAPENKKNTGRVKKH
Dear,
I succeeded a very similar first run with docker (ONT assembly all the rest the same). The ONT run ended after few hours and gave results.
The second run hangs on some prothint record (PacBio assembly) I stopped the first attempt after 2days hanging and restarted fresh in a new folder and it hangs again at the same point.
using teambraker/braker3:latest; v3.0.6
some of my terminal output (full log attached)
braker_firstrun.log braker.log
the current command is:
any idea what this could be and how to circumvent it, here are the running jobs (with 100% cpu on one thread)
Thanks in advance