morrislab / phylowgs

Application for inferring subclonal composition and evolution from whole-genome sequencing data.
GNU General Public License v3.0
108 stars 55 forks source link

Questions about generating CNV Input file #141

Open Inceid opened 1 year ago

Inceid commented 1 year ago

Hello PhyloWGS Devs!

I'm currently trying to run PhyloWGS with copy number variation (CNV) data obtained from whole-exome sequencing. I used a different tool than Battenburg / TITAN to call CNVs and am trying to convert the CNV calls into a format similar to the provided cnv_data.txt.

However, I am having trouble understanding how to calculate the number of reference reads a covering a given CNV.

Additionally, I wanted to clarify my understanding of the example cnv_data.txt input file provided:

cnv a   d   ssms    physical_cnvs
c0  66023,50883,62757,36056,58777   126755,100469,121941,71263,115417   s2,1,2;s4,0,1   chrom=1,start=1234,end=5678,major_cn=2,minor_cn=1,cell_prev=0.8;chrom=X,start=15,end=10000,major_cn=2,minor_cn=0,cell_prev=0.8;chrom=22,start=123,end=456,major_cn=1,minor_cn=0,cell_prev=0.8

This example shows that SSMs s2 and s4 overlap with CNV c0.

Thanks!