dozmorovlab / HiCcompare

Joint normalization of two Hi-C matrices, visualization and detection of differential chromatin interactions. See multiHiCcompare for the analysis of multiple Hi-C matrices
https://dozmorovlab.github.io/HiCcompare/
Other
19 stars 3 forks source link

How to select A.min values using filter_param function? #29

Closed ashishjain1988 closed 1 year ago

ashishjain1988 commented 1 year ago

Hi,

We have only one HiC sample per condition and we are trying to find out differential contacts. While using the default filtering we saw a large number of differential contacts that includes a lot of noise. We tried using the filter_params function to select the A.min values for filtering. I am not sure how to select the A.min value based on the plot from filter_params function as it seems pretty different from the one in the example data. Here is the plot that I got for chromosome 4. image

However, the plot for chromosome 22 something like this: image

We are interested in the analyzing differential contacts in chromosome 4 for our project.

Thanks, Ashish Jain

mdozmorov commented 1 year ago

The plot for chromosome 4 tells that A = 10 will give a reasonable balance of true and false positive rates. I'm not sure what's going on with chromosome 22 - it may have insufficient contacts due to sparsity and being the smallest chromosome. Please, check the actual data, visualize the chr22 matrix. In general, HiCcompare is relatively robust to the choices of A.

ashishjain1988 commented 1 year ago

Hi @mdozmorov, I have another question regarding the filtering process. It is fine if we remove the self ligated contacts from the data before the loess normalization or after carrying out the differential contact analysis?

mdozmorov commented 1 year ago

Any pipeline for Hi-C processing should already filter self-ligated and other artifacts. Not sure why the data you are referring to that has it.

ashishjain1988 commented 1 year ago

Actually, I am using the data generated using Micro-C library preparation. Most of the HiC data processing pipelines takes input the restriction enzyme digested HiC data. I tried to use them for normalization or for fold-change calculation but I didn't find any filtering function for Micro-C data.