dariober / cnv_facets

Somatic copy variant caller (CNV) for next generation sequencing
Other
67 stars 15 forks source link

Clarification concerning CF_EM info tag #17

Closed PubuduSaneth closed 4 years ago

PubuduSaneth commented 4 years ago

Dear Dario

Thank you for the very useful FACETS interface. However, I would like to have a clarification concerning the CF_EM info tag.

According to your readme file,

However, according to R FACETS package,

Therefore, since R FACETS package corrects cellular fraction for the tumor purity,

Please let me know if I have misunderstood your readme file.

dariober commented 4 years ago

Hi- Thanks for your question. The CF_EM tag comes from FACETS' function emcncf which returns, among other things, a dataframe (cncf) with columns:

cncf: dataframe consisting of the columns of segmentation output as well as cellular fraction (cf), total (tcn) and lesser (lcn) copy number of each segment and their em counterpart (with .em suffix)

this where the CF_EM tag comes from. Since I only reformat the output of facets, what applies to the facets package applies also to my wrapper, cnv_facets.R

If I correctly remember, I got the description of cf.em from the vignettes where it says (emphasis mine):

Once the observed total number is obtained we obtain the allele specific copy numbers m and p and the cellular fraction φ using the logOR data. The cellular fraction is associated with the aberrant genotype. For clonal copy number alterations, φ equals tumor purity. For subclonal events, φ will be lower than the overall sample purity.

TL;DR Your understanding is probably correct.

I'm going to close the issue just now but feel free to reopen.

Best Dario

GeorgetteTanner commented 2 years ago

Hi

After running some tests on simulated datasets it is apparent that the CF_EM value is not adjusted for purity, which fits with with the above info "For clonal copy number alterations, φ equals tumor purity."

Therefore I believe the CF_EM value does need to be adjusted for purity in order to get the "somatic cellular fraction".