Closed aksbiome closed 1 year ago
Hey @aksbiome,
Thanks for reporting the buggy behavior of the maxbin Snakefile rule implementation. Could you specify if you used the maxbin
or maxbinCross
method?
As you can see, one of them uses crossmapping information for binning, while the other does not. Note that maxbinCross is the default. Could you also run the command tree -d
in your metaGEM folder to show me where/how the maxbin bins are incorrectly being stored? e.g.
$ tree -d
.
├── metaGEM
│ ├── colab
│ │ └── assemblies
│ │ ├── sample1
│ │ ├── sample2
│ │ └── sample3
│ ├── envs
│ ├── extra
│ └── scripts
└── models
By the way, note that the extractDnaBins
rule simply goes into the bin_reassembly output and copies out the MAGs into a new folder while reformatting the name:
It is actually not surprising that GTDBTk ran successfully, since it just runs on the metaWRAP bin reassembly output, so I imagine that metaWRAP was able to produce some output.
Best wishes, Francisco
Hi Francisco,
When I used metaGEM, everything works well till the metawrap step. However, when I used "bash metGEM.sh -t extractDnaBins" command in step 5 of GTDBK then got an error: " No such file or directory", and I couldn't complete this step. Then I skipped this step and started with foolowing step Run GTDB-tk for taxonomic classification: bash metaGEM.sh -t gtdbtk -j 2 -c 24 -m 80 -h 12, surprisingly it worked, I guess. So can you look into this issue. Please find attached screen shot of my issue.
Another issue which need to resolve in metagem pipeline is when we run maxbin2 command then bins generated was not properly saved in respective folder, for that we need to manually add bins to that folder then only next metawrap command works.
Thanks!