Closed AryehMiller closed 7 months ago
This type of error is generally due to a bug in Cactus, not Toil. I'm having a lot of trouble reproducing though. Did it happen before you ran with --restart
? Do you have the original logs from all your runs?
I did notice a separate bug which I fixed here: https://github.com/ComparativeGenomicsToolkit/cactus/pull/1371
The upshot of this one is that by only running the RED masking (not not lastz), the command should run about 100X faster. You can achieve this with the current release by adding --maskMode red
.
Hi Glenn, thanks for your quick response! I just e-mailed you the full log file from this failed run. I believe the run did indeed work before I ran with --restart, but can't seem to find the log file for that run (it was with a different dataset).
Given the resolution in #1371, do you recommend beginning a new run, without --restart
, and adding in --maskMode
red flag? Thanks!
The log you sent is with --restart
-- it's the log from the first run before running again with --restart
that I'm after. The reason is I can't reproduce it locally, so I'm wondering (and this has happened before) if the error is particular to running with --restart
-- I've tried aborting it myself and restarting without much luck.
Anyway, yes my suggestion is definitely to rerun, from scratch, using --maskMode red
-- this is what the default behaviour should be, if it weren't for bug #1371 (and will be in the next release). This shouldn't take too long : a 3GB genome typically runs in under 2 hours on 1 core. Even if your input is already masked with other tools, this will often save you tons of time and memory in subsequent stages of the pipeline.
Can't seem to find the log from the first run without --restart
, but have added in --maskMode red
in a run from scratch, and the preprocess run step ran cleanly and finished in 35 mins! Thanks for helping to resolve this!
Hi there! I'm currently trying to align several genomes in Cactus using the step-by-step approach (generated from
cactus-prepare
), and finishing up thecactus-preprocess
step.Here's my command:
cactus-preprocess /scratch/am3329/cactus/anoles/0 /projects/f_geneva_1/aryehmiller/cactus/LizardSeqsFinal.txt /projects/f_geneva_1/aryehmiller/cactus/AliOut/LizardSeqsFinal.txt --inputNames AnoFre AnoChl AnoSem AnoBar AnoCar AnoCyb AnoDis AnoSag AnoAur AnoGra AnoVal --maxCores 40 --maxMemory 300G --restart --logFile /projects/f_geneva_1/aryehmiller/cactus/AliOut/logs/preprocess-AnoFre.log
Everything ran fine for several days, but now I'm running into a persistent error in the SLURM log returning some sort of toil DAG related error? I've printed the toil information for the jobStore below-- perhaps this is a bug with how toil is marking and queueing jobs? Let me know if you have any thoughts on how to resolve this one, and many thanks!