nextflow-io / nextflow

A DSL for data-driven computational pipelines
http://nextflow.io
Apache License 2.0
2.73k stars 624 forks source link

Some processes were skipped during resume #4406

Open fengqing-dong opened 1 year ago

fengqing-dong commented 1 year ago

Problem: the workflow run only can run correctly at the frist time ; it will be failed when with -stub or -resume. Some process will be skipped.

why and how to fix it? Can you give me some advice?

The detail informaton as flow:

bentsherman commented 1 year ago

Can you share the complete log file? And is your pipeline repo publicly available? If not, it would be good to see the source code for the processes that were skipped.

fengqing-dong commented 1 year ago

Due to confidentiality, I am so sorry for that the full process code cannot be provided. Here's the streamlined process.

bentsherman commented 11 months ago

Thank you for sharing what you could. From your pipeline code and log, the key issue is that QC:QC_PICARD was skipped, which caused the other processes to be skipped because they depend on QC.out.collect(). QC_PICARD depends on ALN which was not included, but clearly ALN was executed. So my guess is that something is wrong with the outputs of the ALN workflow. I would look there.

Another helpful debugging tool might be the nf-prov plugin. Why don't you use it with the dag format to generate a task DAG for the initial and resumed runs? Then you can compare them, and it might reveal exactly where the chain was broken.