stschiff / msmc2

GNU General Public License v3.0
53 stars 9 forks source link

Question about phasing and the cross-coalescence rate #50

Open luisamarins opened 1 year ago

luisamarins commented 1 year ago

Hello, I have a dataset of three genomes from individuals from three different populations. From the tutorial I read that for a single diploid genome as input (i.e., two haplotypes), no phasing is necessary. So I didnt worry about phasing for my "within population" runs.

I now want to estimate the coalescence rate across populations, to estimate the timing of the split between them. I have a few questions:

Your help is greatly appreciated :)

stschiff commented 1 year ago

Yes, in this case phasing is needed. I don't fully get your list of indices. You can certainly prepare your input file for three populations, but the estimation should focus on a pair of populations between which you want to compute the cross-coalescence rate.

So if you have indices 0,1 for pop1; 2,3 for pop2 and 4,5 for pop3, then you could run