Open carla-hazelf opened 7 months ago
First off, this is unrelated to your issue, but if you're running it on a cluster, you'll need to set some additional options to tell nextflow to run the jobs on the cluster nodes instead of the head nodes. See here for more info: https://www.nextflow.io/docs/latest/executor.html
Other than that, you appear to be running the pipeline correctly. The only step that failed is the step to make a heatmap that you can open in juicebox. However, the error message says that there were 0 reads in the input, so my guess is that the scaffolding didn't work either. So first, take a look at the assembly output and the stats related to it. Does it look like the scaffolding actually worked (e.g., is the N50 bigger after scaffolding than before)? If not, how many reads did you start out with vs. how many got aligned? You can look at all the intermediate files by going into the work directory and then the first few characters of the directory for that step are in the nextflow output — for example, the alignment step's work directory should start with work/ca/a11554
, so you can look there for the bam files and any error messages that step generated (in .command.err
).
Hope this helps!
Hi I am also getting the same error. Were you able to resolve this issue?
Any help is appreciated.
Best Kritika
Hi, Sorry for delayed response- @esrice, thank you for your quick response at the time, it's really appreciated. @gargkritika personally, I was having issues with nextflow more generally, so I ended up doing it manually. I followed the description here; https://github.com/GenomicsAotearoa/High-quality-genomes/blob/main/Centrostephanus/Urchin_HiCScaffolding_V4.ipynb
So, mapping my reads to my assembly using bwa (-5SP is the option for Hi-C data). I then marked duplicates in the .sam file using samblaster. Converted it to BAM, and filtered out secondary alignments and unmapped reads; then further sorting by coordinates/read names for downstream Hi-C analyses. Then it was a matter of following the yahs protocol; https://github.com/c-zhou/yahs I'm no expert! But this worked for me. Hope this helps. If you're needing this nextflow pipeline, try following the suggestion above and see if it works for you-- I don't recall if I got around to trying it or not.
Hello,
Thank you for developing this pipeline.
I run the pipeline on a linux HPC system with the following input using -profile conda:
And I receive the following error;
EDIT; I am new to Hi-C, and I did not prepare this data myself; am I misunderstanding any preprocessing steps I need to do with the HiC illumina data, or am I not understanding the code? Thank you