popgenmethods / smcpp

SMC++ infers population history from whole-genome sequence data.
GNU General Public License v3.0
149 stars 34 forks source link

Cannot detect my simulated bottleneck #250

Open JaneeshBansal opened 1 year ago

JaneeshBansal commented 1 year ago

Hi,

I am having trouble identifying a bottleneck that I simulated.

I simulated data using msprime with the following values ...

I first sampled a single diploid individual, however, the plot did not show a bottleneck. From the paper, I understood that 2 lineages are part of the PSMC and the n-2 are part of the SFS. But if I sampled a single individual then only the PSMC part of SMC++ is being ran?

I also ran my simulation through MSMC2 running PSMC' and the plot did show a bottleneck shape.

Additionally, I sampled 5 and 50 diploid individuals but the characteristic bottleneck shape was not appearing in SMC++.

Could I please receive some advice on this?

Thank you Janeesh

bn_psmc bn_smcpp

terhorst commented 11 months ago

It looks as SMC++ is fitting a single constant piece over the whole region 1e5~7e5 generations. Try specifying the timepoints manually using the --timepoints option as described in the README.