im3sanger / dndscv

dN/dS methods to quantify selection in cancer and somatic evolution
GNU General Public License v3.0
211 stars 48 forks source link

Normalize to depth of coverage across mitochondrial genome target? #72

Closed tessleuthner closed 2 years ago

tessleuthner commented 2 years ago

Hello Inigo,

I am working a project where we have done targeted Duplex Sequencing of C. elegans mitochondrial DNA after a mutation accumulation line experiment. I'm working with Scott Kennedy, and he mentioned using dndscv to look at dN/dS. I have it working (I am happy to provide you the C. elegans mitochondrial reference genome RefCDS object).

However, something that Scott and I are concerned about is that I have somewhat non-uniform sequencing coverage across the genome. I have attached an example from one library. Though on average we have high depth (~5,000X/sample), you can see that it is somewhat variable. I attached a second file where I have plotted the SNVs (in Geneious Prime), and it appears that there are in fact a couple coding regions that have no mutations where coverage was very low.

Scott and I would be really interested to hear if you think there is a way to normalize dN/dS rates to the variable coverage. Please don't hesitate to reach me at tcl21@duke.edu, though I'm sure other folks might be interested in this issue too. Thanks! -Tess

Celegans example DupSeq coverage

N2_Geneious_graph.pdf

im3sanger commented 2 years ago

Hi Tess,

Although we are communicating by email, I thought I would provide an answer here for other users.

Briefly, there are two analyses to consider:

Best wishes, Inigo