deeptools / HiCExplorer

HiCExplorer is a powerful and easy to use set of tools to process, normalize and visualize Hi-C data.
https://hicexplorer.readthedocs.org
GNU General Public License v3.0
227 stars 70 forks source link

Use sumMatrices to implement scatter-gather technique for cloud deployment #814

Open biofilos opened 2 years ago

biofilos commented 2 years ago

Welcome to the HiCExplorer GitHub repository! Before opening the issue please check that the following requirements are met :

Retry your command, is it solved now? If not please continue with the following:

I am implementing a hic-explorer pipeline in WDL via cromwell for deployment in AWS. One of the easiest ways to reduce running time, in pipelines is usually to chop up fastq-files (scatter), run the entire pipeline with each chunk, and then merge the results (gather). If I do this with the hic-explorer pipeline, can I run hicSumMatrices across all the matrices generated for each chunk and get an equivalent matrix as if I run the whole pipeline with one pair of big fastq files?