PoonLab / Kaphi

Kernel-embedded ABC-SMC for phylodynamic inference
GNU Affero General Public License v3.0
4 stars 2 forks source link

Validate ABC-SMC parameter estimation on BiSSE model #101

Open MathiasRenaud opened 7 years ago

MathiasRenaud commented 7 years ago

All the kinks should be worked out of BiSSE model now. It run with 10,100, and 1000 particles. With all 6 priors set as a gamma distribution(rate=2, shape=1) and all proposals set to normal distribution (mean=0, sd=0.1), all parameters were overestimated (priors need tuning), but it runs: rplot

ArtPoon commented 6 years ago

Work on this issue will also be dependent on resolving parallelization #26

ArtPoon commented 6 years ago

@helenhe96 to try re-running with different prior specifications. For a sanity check, try priors that are tightly centered around the actual values.

helenhe96 commented 6 years ago

The simulation took 4624 seconds (77 minutes) on one thread.

helenhe96 commented 6 years ago

If using priors centered around the true values (with normal distribution), the posteriors are good representations of the true values: bisse

ArtPoon commented 6 years ago

Okay! But I'm guessing that we've "locked in" the priors tightly around the true parameter values. The next step is to relax the variance in the priors, and deflect the initial parameter values away from the true values.

ArtPoon commented 6 years ago

When you run this analysis, can you please use multiple threads?

helenhe96 commented 6 years ago

When I use multiple threads on this my computer crashes... I have tried 10 threads and 5 threads.

gtng92 commented 6 years ago

Issue #131

ArtPoon commented 6 years ago

From dev meeting:

Next steps:

ArtPoon commented 6 years ago

Pending implementation of labelled tree kernels, see #133

ArtPoon commented 6 years ago

Before working through the proposed experiments above, better do some checks on responsiveness of kernel distances (and perhaps other tree distances) to varying BiSSE model parameters.

  1. Simulate trees where one parameter is varied and the other model parameters are fixed (will need to determine what are "informative" parameter settings here).
  2. Select one tree to be the "target" tree and calculate distances of other simulations to the target.
  3. Plot distances versus parameter value, hopefully we see a concave trend where distance reaches a minimum near the true parameter value.