WARNING 2019-03-03: The c3co package is under heavy development.
remotes::install_github("pneuvial/c3co", build_vignettes = TRUE)
This package implements a constraint dictionary learning algorithm to recover subclones across several DNA copy number profiles. The c3co model is described in the figure below. Two heterogeneous tumor samples (green and yellow circles) are composed of a collection of normal cells (gray discs) and two cancer subclones (red triangles and blue squares). One of the cancer subclones is present in both tumor samples.
The corresponding (noiseless) total copy number profiles are displayed in the figure below. They are given by a linear combination of the latent profiles. This Figure is adapted from Nowak et al. (Biostatistics, 2011).
The c3co
method aims at inferring the latent (blue, red and gray) copy-number profiles and the corresponding weights from the observed (yellow and green) copy-number profiles. Importantly, c3co
takes advantage of the information of parental copy numbers, which are available from allelic ratios (a.k.a. B allele fractions or BAF) as measured by SNP array data or whole exome (WXS) or whole genome sequencing (WGS) data.
See the package vignette