ay-lab / dcHiC

dcHiC: Differential compartment analysis for Hi-C datasets
MIT License
57 stars 10 forks source link

How to deal with technical duplication #11

Closed biozzq closed 3 years ago

biozzq commented 3 years ago

Dear all,

Because of low sequencing depths for some samples in my study, I have constructed another library for these samples and sequenced them to a high coverage. When preparing the input files to dcHiC, I'm not sure I'm doing the right thing. For these samples, I simply merged the validPairs not the allValidPairs by using cat, and then converted the merged file to sparse matrix/bed files by using buildmatrix.

Sincerely. Zheng zhuqing

ay-lab commented 3 years ago

Hi,

You can treat each validpair file as an independent replicate. For that you can create count.matrix_1 from validpair1 and matrix_2 from validpair2 using buildmatrix program separately. If you provide the replicates then dcHiC can utilize the intra-replicate variation present in your dataset. Concatenation is the next best option. Let us know how the result looks like and makes sense to you.