Closed Lucas-Maciel closed 2 years ago
Hi Lucas, Thanks for reporting the error and sharing the logs, very helpful. At first glance it seems to be an error related to Pathway Tools, could you please tell us the version you use? In addition please share the versions of mpwt and metage2metabo you have installed. Cheers
Hi,
mpwt v0.6.0 pathway-tools 24.5 m2m 1.4.1
This issue has been fixed. Please remove empty migs file from the input directory.Please restart the pathway tools to load the latest patch.
Hi @akoth9 , how do I restart the pathway tools? I tried to run just the command pathway-tools and a few patches were downloaded. But I'm still not able to produce migs files.
04-Mar-2021 13:04:46 Fatal error: file #P"gbk/teste/migs.dat" does not exist: No such file or directory [errno=2].
Hi,
Thank you @akoth9 for the fix.
@Lucas-Maciel, it is strange I update Pathway Tools on my computer and it fixes the issue with migs.dat on your data.
Pathway Tools automatically downloads the last patch when you use the pathway-tools command. Just to be sure, can you use the following command:
pathway-tools -id
It will list all the patches associated to pathway-tools. And in the list, you have these lines:
...
**** The history for this executable is: ****
This image contains systems (Ptools), version (24.5 . 24.5)
loaded from /home/ratel/pathway-tools/aic-export/pathway-tools/ptools/24.5/
load began on Thu Jan 14, 2021 at 17:34:28 (successful completion assumed)
The following patch files are loaded:
patch-package-sri
p4513-operon
p4512-operon
p4511-migs-issue-batch-pathologic
p4510-operon
p4509-metacyc-pwy-org-crash
...
The p4511-migs-issue-batch-pathologic
is the patch fixing the migs.dat issue. Do you see it?
Also, in what environment is your Pathway Tools installed: on a personal computer, on a cluster? One possibility that comes into my mind is when using Pathway Tools with Singularity the patches are not correctly applied due to permission issue.
I was finally able to run, but now I got a new one, related to my input pathologic.log
Inference statistics:
Build failed: 1
Species: teste
------------ Species: teste
batch-pathologic: A fatal error occurred for gbk/teste/ version 1.0.
See pathologic log file gbk/teste/pathologic.log for more details.
04-Mar-2021 18:14:25 Fatal error: end of requested sequence 1178 exceeds length of cached chromosome 502
Evaluation stack:
->(TPL::ZOOM-COMMAND :FROM-READ-EVAL-PRINT-LOOP NIL ...)
(SYS::..RUNTIME-OPERATION "applyn" :UNKNOWN-ARGS)
(TPL:DO-COMMAND "zoom" :FROM-READ-EVAL-PRINT-LOOP ...)
Good news if the migs.dat error is fixed.
For the new error:
04-Mar-2021 18:14:25 Fatal error: end of requested sequence 1178 exceeds length of cached chromosome 502 Evaluation stack:
This message is saying that a gene with a sequence of size 1178 is in a chromosome with a sequence of size 502. And this is linked to the name of the locus of some of the records of the genbank file. At some point in the file, the locus IDs are all the same ('lcl'). Pathway Tools use this as an ID but as there is multiple records with the same ID they are overwrote by the information of the next record.
For example, this record with a gene sequence of 1178:
LOCUS lcl 1178 bp DNA UNK 17-FEB-2021
DEFINITION cellular organisms genome.
ACCESSION lcl|NZ_ACFY01000003.1_cds_3836
VERSION lcl|NZ_ACFY01000003.1_cds_3836
KEYWORDS cellular organisms.
SOURCE .
ORGANISM cellular organisms
.
FEATURES Location/Qualifiers
source 1..1178
/scaffold="lcl|NZ_ACFY01000003.1_cds_3836"
/db_xref="taxon:131567"
gene 2..1178
/locus_tag="lcl_0001"
CDS 2..1178
/locus_tag="lcl_0001"
ORIGIN
1 atgaaagaat tagagttaaa atacggatgt aatccaaacc agaaaccatc cagaatttat
61 atggaaaacg gcgagcttcc aatcaaagtg ttaaacggaa aaccgggtta tatcaacttt
121 ttagatgcat ttaacggctg gcagctggta agtgaattaa agaaagcaac cggacttccg
181 gcagcaacct cttttaaaca tgtatctccg gctggagcag cagttggact tccattgtct
241 gaggtagagc gcaaaatcta ttgggtagac gatatggacg tagagttcac accgcttgca
301 aacgcctata tcagagccag aggcgcggac agaatgtctt ctttcggcga tttcatttcc
361 ttatccgatg tctgtgacaa agagacagca cttgtaatca aacgagaggt gtctgatggt
421 gtgatcgcac cgggctatac agatgaggca cttgagattt taaaggcaaa gaaaaaggaa
481 actacaatgt gatcgagatc gatccgaatt atgtgccggc accaatcgaa cataaagaag
541 tgtttggaat cacatttgaa cagggaagaa atgagcttgt gatcgatgag catttctttg
601 acaacatcgt gacagagaat aaggaaatcc cggattctgc gaaaatggat cttgcaattt
661 ccatgattac attaaaatac acacagtcca attccgtatg ctatgtcaaa ggtggacagg
721 caatcggtat cggtgccgga cagcagtcca gaattcactg taccagactg gccggctcta
781 aggcagataa ctggtggtta cgccagtctc cacaggtgct cggacttcag tttttagata
841 agattggacg tgcagacaga gataatgcga ttgatctcta catcggtgag gactatatgg
901 atgtgcttgc agatggcgca tgggagaata tttttaaagt aaaaccggaa gtgtttacaa
961 gggaggaaaa acgcgcatgg ttagataaaa acactgacgt tgcgcttggt tccgatgcat
1021 tcttcccatt tggagataat gtggaacgtg ctcataagag cggagtgaaa tacatcgccc
1081 agccaggtgg ttctatccgt gatgatcatg tcattgcaac ctgtaacaaa tacggcattg
1141 ccatggcatt caccggaatt cgcttattcc atcattaa
//
will be used with the chromosome sequence of the last record with the ID 'lcl':
LOCUS lcl 502 bp DNA UNK 17-FEB-2021
DEFINITION cellular organisms genome.
ACCESSION lcl|NZ_ACFY01000177.1_cds_30
VERSION lcl|NZ_ACFY01000177.1_cds_30
KEYWORDS cellular organisms.
SOURCE .
ORGANISM cellular organisms
.
FEATURES Location/Qualifiers
source 1..502
/scaffold="lcl|NZ_ACFY01000177.1_cds_30"
/db_xref="taxon:131567"
gene 2..502
/locus_tag="lcl_0001"
CDS 2..502
/locus_tag="lcl_0001"
ORIGIN
1 atgtttgaag taggagaata tattgtttat ggatgtaaag gtgtatgtca ggtagaagaa
61 atcacacata ttgatattcc tggcagtaat aaggacaggc tgtattatgt tttggctccg
121 ttagaggaca gaaacggcaa aatatatgcc ccgaccgata atgccaaggt ggcaatgaga
181 aaagtcatta ccagacagga ggcagagcag ctgatcgagg agatgccgca gatcgaagaa
241 ctgtgggtag tcaatgaaaa acagcgggaa ctgcaatata aagaggcgtt aaagacctgt
301 gatttcaggg catggatcag tattatcaag acattatatt tccgaaaaaa aagagcggat
361 cgctcagggc aaaaaaataa catctttgga tgaacgttat ttaaaggctg cagaaaatga
421 gctttatggc gaattgtcac tgatattggg gcttcctaag agtcagatgg aagatttcat
481 ttgcagaaag cttgaggcgt ag
//
because they have the same locus ID. That's why, the error message says that there is a gene sequence of 1178 with a chromosome sequence of 502.
To fix this, you need to rename these locus IDs to make them different.
And I think the same idea goes for the locus_tag of the gene/CDS, which are all "lcl_0001". It should be better if they were different.
Hi, I suppose this issue was solved. I'll close it, do not hesitate to reopen it if needed.
Hi,
I'm trying to run your pipeline but I had an error. Do you have any idea how can I solve this? I'm leaving here the input and outputs.Thank you very much m2m_test.zip ` ######### Running metabolic network reconstruction with Pathway Tools ######### Check and delete unfinished builds of Pathway Tools.