replikation / poreCov

SARS-CoV-2 workflow for nanopore sequence data
https://case-group.github.io/
GNU General Public License v3.0
39 stars 17 forks source link

Process retry in slurm profile #263

Closed MarieLataretu closed 4 months ago

MarieLataretu commented 7 months ago

Hi there,

currently, we ignore errors in the slurm profile:

https://github.com/replikation/poreCov/blob/0ff536fe01c2c2410e1b666c74a3bd43bd8dc287/nextflow.config#L131

I'd like to suggest to change this to:

process.errorStrategy = 'retry'
process.maxRetries = 1

Sometimes we see a process fail, because of HPC/SLURM hiccups. Usually, the exit status is 1 and the process runs through in the first resume run.

If you are fine with that, I'd make a PR 🙂

hoelzer commented 7 months ago

yes, good idea! +1