sjczheng / EpiDISH

This package contains a reference-based function to infer the proportions of a priori known cell subtypes present in a sample representing a mixture of such cell-types. Inference proceeds via one of 3 methods (Robust Partial Correlations-RPC, Cibersort (CBS), Constrained Projection (CP)), as determined by user.
42 stars 11 forks source link

Does EpiDISH works for WGBS data? #6

Closed Niinleslie closed 4 years ago

Niinleslie commented 4 years ago

Hi, Can I apply EpiDISH to esitmate immune cell fractions of WGBS data from liver tumor tissues? If so, how should I prepare the input and ref data? (I can only find the example for beta value matrix).

Thanks!

sjczheng commented 4 years ago

EpiDISH can be used with WGBS data. You need to make a reference matrix with row names as the same row names of the data matrix. For WGBS, you can do chr.position as row names. Or you can make your own style. The values of this reference matrix can be computed using purified WGBS or array data. If you use array data, just remember to make sure that the genome version you are using is consistent with the array annotation when you do position liftover.

KristenKrolick commented 1 year ago

@sjczheng

Hello, thank you so much for making this tool! I am making my own reference consisting of different WGBS blood cell types and the reference data I am able to download from GEO happens to have the methylation value conventions of 1 to 1000 (they said they did meth calling with wgbstools), while my dataset has the methylation value convention as used by bismark of 0 to 1. Is this going to be a problem, or will the software somehow correct for this anyways?

Thank you!

sjczheng commented 1 year ago

Hi, You need the methylated counts and unmethylated counts to calculate beta values. The software won't correct this.

KristenKrolick commented 1 year ago

@sjczheng The reference datasets do not have methylated and unmethylated counts, they are already beta values then?

Furthermore, I am loading my data from bismark.cov files into R using methrix to make the input matrix into Epidish. I believe methrix must be making the 'beta' values or ratio of methylation calls for me, because my input are only methylation values 0-1. Do you know if this would be fine?

Thank you for your input! Kristen

sunyu1010 commented 6 months ago

@sjczheng Hi, thank you so much for your answer! So if I want to apply EpiDISH to WGBS data, I should calculate the methylation proportion by dividing methylated counts by all counts. In other words, the methylation proportion is equivalent to beta value of arrays. Is that true? I've just started to learn WGBS, this may be an elementary question, sorry to bother you!