cooler file preprocessing for Akita

I have some microC data from some cell lines that I'd like to implement in a similar way to the Akita manuscript. I was just wondering if you did any processing of the cooler files, beyond running distiller_nf and matrix balancing (iterative correction)

From the tutorial it doesn't look like it, but in the Akita manuscript:

To focus on locus-specific patterns and mitigate the impact of sparse sampling present in even the currently highest-resolution Hi-C maps, we adaptively coarse-grain, normalize for the distance-dependent decrease in contact frequency, take a natural log, clip to (−2,2), linearly interpolate missing bins and convolve with a small 2D Gaussian filter (sigma, 1 and width, 5). The first to third steps use cooltools functions

I'm guessing you used the adaptive_coarsegrain function in cooltools for the 1st step, but I'm uncertain how the distance-dependent normalisation, interpolation, or convolution were implemented. Were those for a specific case in the manuscript and not necessary?

Also I saw you'd rerun the analysis splitting the genome into multiple folds. Do you have any advice/code as to how to implement this?

Thanks Philip

calico / basenji

cooler file preprocessing for Akita #143