tom-hc-park / MSc-RA-Bayesian-evidence-synthesis

Research Project at M.Sc. Statistics at University of British Columbia
4 stars 0 forks source link

Questions draft (I will edit and trim my questions better soon) #3

Open tom-hc-park opened 4 years ago

tom-hc-park commented 4 years ago
  1. prior predictive distribution vs prior distribution:

    distribution of y values vs distribution of parameters before data given

  2. posterior predictive distribution vs posterior distribution

    distribution of y values vs distribution of parameters after data given. PPD has wider range of samples since it accounts for uncertainty of parameters ?

  3. posterior predictive check compare the given data, and the new samples from PPD. are we using the data twice, once for obtaining the posterior distribution, twice for getting PPD?

  4. Mike's variational Bayes gives us trace and ppc while MCMC giving me an error: Not enough samples to build a trace. Is it an advantage of VB? being able to get samples in some not good enough situations? what does 'Not enough samples to build a trace.' mean? The 'samples' referring to the original data set? (I doubled the sample size to produce but got the same error).

  5. why do we call trace as trace (trivial)

  6. sometimes the process of sampling (Variational Bayes) stopped in the middle and lagged. Or sometimes it works.... I am confused. Is it a a. converging issue, not convergence -> infinite loop? b. any other reason? c. is it common thing to handle when we do Bayesian analysis ?

  7. Adding time variation, overdose probability p is increasing from Mike's plot while mine is not. I am wondering it's just my coding issue (maybe something I did wrong) or a difference can be made from MCMC and VB.

(link for questions problem)

paulgstf commented 4 years ago

First pass at responses.

  1. prior predictive distribution of a datapoint versus prior distribution of parameters.

  2. Ditto. posterior predictive for y_next | y_collected, versus posterior for parameters given y_collected.

  3. Yes, the posterior predictive checks advocated by Gelman et al. do receive some criticism for using the same data twice.

  4. Hmmmm, that error message is a bit cryptic.

  5. Not sure, think because we are "tracing" the MCMC output across iterations.