EBI-Metagenomics / ebi-metagenomics-cwl

This repository contains the CWL description of the EBI Metagenomics pipeline
21 stars 14 forks source link

toil with CWL on LSF status #57

Open mr-c opened 7 years ago

mr-c commented 7 years ago

Current working branch will the bulk of the above fixed merged: https://github.com/mr-c/toil/tree/issues/1666-fail-not-on-unsubmitted-jobs Latest Toil release has all of the above mentioned fixes merged

mr-c commented 7 years ago

Note: In cwltoil, sub-workflows must fully complete before any of their outputs are available for use by any other step/job. For example, the go_summary in the functional analysis (IPS) workflow isn't subject to further processing, but its production holds up the availability of the functional_annotations for futher processing by the parent workflow.

mr-c commented 7 years ago

To run the CWL conformance tests using cwltoil on LSF

virtualenv env
source env/bin/activate
pip install -U pip
pip install -U setuptools wheel
pip install .[cwl]
git clone https://github.com/common-workflow-language/common-workflow-language.git
cd common-workflow-language
pip install cwltest
TMP=$PWD ./run_test.sh RUNNER=toil-cwl-runner EXTRA="--batchSystem LSF --logDebug --logFile ${PWD}/log --disableCaching --user-space-docker-cmd=udocker" -j8"

(edited to use " double quote instead of single with EXTRA) (edited to set TMP to a path on the shared filesystem, needed for cwltest)

hmenager commented 7 years ago

Note for @mr-c : here's what I got from toil[cwl] running a workflow on a single machine, at some point I hear complaints about disk usage, although I never specified any requirements on that:

ripley 2017-06-08 18:50:26,180 Thread-82 WARNING toil.statsAndLogging: Got message from job at time 06-08-2017 18:50:26: Job used more disk than requested. Please reconsider modifying the user script to avoid the chance  of failure due to incorrectly requested resources. Job 'file:///home/hmenager/ReproHackathon/reprohackathon1/cwl/tools/fastq-dump.cwl' fastq-dump 8/A/job6HSIGE used 128.93% (2.6 GB [2768723968B] used, 2.0 GB [2147483648B] requested) at the end of its run.

The tool itself is defined there: https://github.com/IFB-ElixirFr/ReproHackathon/blob/cwl/reprohackathon1/cwl/tools/fastq-dump.cwl

mr-c commented 7 years ago

Priorities: