Closed PubuduSaneth closed 4 years ago
Hi- Thanks for your question. The CF_EM tag comes from FACETS' function emcncf which returns, among other things, a dataframe (cncf) with columns:
cncf: dataframe consisting of the columns of segmentation output as well as cellular fraction (cf), total (tcn) and lesser (lcn) copy number of each segment and their em counterpart (with .em suffix)
this where the CF_EM tag comes from. Since I only reformat the output of facets, what applies to the facets package applies also to my wrapper, cnv_facets.R
If I correctly remember, I got the description of cf.em from the vignettes where it says (emphasis mine):
Once the observed total number is obtained we obtain the allele specific copy numbers m and p and the cellular fraction φ using the logOR data. The cellular fraction is associated with the aberrant genotype. For clonal copy number alterations, φ equals tumor purity. For subclonal events, φ will be lower than the overall sample purity.
TL;DR Your understanding is probably correct.
I'm going to close the issue just now but feel free to reopen.
Best Dario
Hi
After running some tests on simulated datasets it is apparent that the CF_EM value is not adjusted for purity, which fits with with the above info "For clonal copy number alterations, φ equals tumor purity."
Therefore I believe the CF_EM value does need to be adjusted for purity in order to get the "somatic cellular fraction".
Dear Dario
Thank you for the very useful FACETS interface. However, I would like to have a clarification concerning the CF_EM info tag.
According to your readme file,
However, according to R FACETS package,
Therefore, since R FACETS package corrects cellular fraction for the tumor purity,
Please let me know if I have misunderstood your readme file.