Open gacrestani opened 1 week ago
@nservant this looks like the issue you were telling me about GATK using async I/O. I see -Dsamjdk.use_async_io_write_samtools=true
in the command line, is that the flag you had to disable?
Indeed, I tested this parameter, but in my case, it was more a cluster I/O issue.
Increasing the boost.cleanupInterval
to 180 sec fixed the issue.
But I guess tuning both parameters could be useful
@gacrestani try adding this setting to your nextflow config first:
boost.cleanupInterval = '180s'
If that doesn't work then try disabling the flag I mentioned. But ideally the cleanup should be able to work with GATK's async I/O which will be more efficient. I think the default cleanup interval is just too short
EDIT: fixed quotes in config code
Thank you. I will add the flag, rerun everything, and let you know how it goes.
A ran just crashed due to the same error. Do you think that further increasing the time would help solve the problem?
[cqls-x86_64-tcsh crestang@shmoo nextflow]$ cat genomics_pipeline_trial70/genomics_pipeline_trial70.o100431
Started on: shmoo
Started at: Mon Sep 23 02:49:28 PM PDT 2024
N E X T F L O W ~ version 24.04.3
Launching `main.nf` [exotic_sinoussi] DSL2 - revision: 45e929d511
B U R K E L A B P I P E L I N E
===================================
[8a/d89358] Submitted process > BwaMem (1)
[20/1e8fd5] Submitted process > BwaMem (2)
[d6/8d2aa4] Submitted process > BwaMem (3)
[0c/fbeac6] Submitted process > BwaMem (5)
[f1/109c6b] Submitted process > BwaMem (4)
[a7/f46cc7] Submitted process > BwaMem (6)
[25/14158e] Submitted process > MergeSamFiles (1)
[49/20221b] Submitted process > MergeSamFiles (2)
[a6/b8f854] Submitted process > MarkDuplicates (1)
[b5/28c8b6] Submitted process > MarkDuplicates (2)
[2e/5a2a86] Submitted process > BaseRecalibrator (1)
[bb/acb235] Submitted process > BaseRecalibrator (2)
ERROR ~ Error executing process > 'BaseRecalibrator (2)'
Caused by:
Missing output file(s) `EB_rep04_gen20_duplicates_marked.bam` expected by process `BaseRecalibrator (2)`
Hello, thanks for the plugin!
I am having some issues with it, though. It is apparently deleting intermediate files marked for use in upcoming processes. When running my pipeline (which I created myself), it fails with the following output error:
My workflow is:
And the processes up to the crash are:
My
nextflow.config
file looks like this:If I set
cleanup = false
, the pipeline runs and completes without issues. I am running Nextflow version 24.04.3 on my university's HPC (which uses SLURM). I am likely doing something wrong! Can you please help me troubleshoot this? Thank you very much!