Evaluate what parts of the Bayesian Workflow to cover

jobrachem commented 1 year ago

Arose from

https://github.com/liesel-devs/liesel-internal/issues/252

Bayesian Workflow paper: https://arxiv.org/abs/2011.01808

jobrachem commented 1 year ago

[ ] Johannes prepares some structure

jobrachem commented 1 year ago

Nice overview paper. Recommendation for Paul & Gianmarco
High-level description of the steps you go through in doing applied Bayesian work
Nothing really groundbreaking, but some nice practical / philosophical aspects about Bayesian work
Hannes & Johannes may give a summary?
- Summary is difficult, because the paper is very broad and each individual section is already highly summarized
- What do they actually really talk about?
- Pick out some interesting individual points
  - main take-aways
  - surprises
  - "guilty recognitions": we know, we should do this, but we don't
  - where are they very opinionated, where do we disagree?
  - what are take-aways for the development of Liesel
- Possible discussion: What do we think about posterior predictive checks?

GianmarcoCallegher commented 1 year ago

Can we close this issue?

GianmarcoCallegher commented 1 year ago

On 07.06 we will discuss again about this issue

GianmarcoCallegher commented 1 year ago

The evaluation of the Bayesian Workflow took place on 15.06. We went through the list of aspects that could become features of Liesel, commented on them and gave our opinion on their priority for the next months/semesters. Here they are:

What aspects of the Bayesian workflow could become features of Liesel?

Prior and posterior predictive simulation (Section 2.4 and 6.1)
- There are some theoretical concerns, but the functions will undoubtedly be useful in many situations
- Dr. Hannes has done previous work, will be added to main sooner or later
Pathfinder, better initial values (Section 3.1)
- VI approach to find some initial values, alternatively some other optimization algorithm (+ jitter) could be implemented
- The consensus is that some kind of optimization algorithm for Liesel models would be helpful and could be based on optax, jaxopt
Faster, approximate inference algorithms to speed up model building (e.g. VI, Section 3.3)
- Closely related to the implementation of a simple optimization algorithm, but the long-term goal could be something like a Goose equivalent for VI
- The long-term is probably not realistic for the four of us in the next couple of months/semesters
- Sebastian's master thesis is related, so let's wait for his results
Run MCMC until $\hat{R} < 1 + \varepsilon$ (Section 3.2)
- Could be implemented as a callback in Goose checking "some" criterion and deciding whether to stop the sampling (and communicate the reason)
- Noone depends on this critically, but it would be really nice to have and could be a real timesaver/convenience
Early stopping/failure criteria in Goose (Section 3.4)
- Could be implemented like (4), but here the checks are model-specific or kernel-specific rather than chain-specific
Stacking to reweight poorly mixing chains (Section 5.5)
- Addressing the question: How can we learn the most from a given MCMC fit even though the chains/mixing are not perfect?
- Should be relatively easy, maybe a student could work on it in the Statistical Practical or the Advanced Bayes seminar
Automatic marginalization of discrete parameters (Section 5.8)
- How often do we encounter discrete parameters? At least sometimes: Spike & Slab, Tensor Product Interactions, Species Occupancy
- What is the cost of the alternatives, i.e. manual marginalization, and how often is automatic marginalization even possible?
- Not very high up on our priorities
Cross-validation (Section 6.2)
- Is this a task for Liesel?
- The design could be based on something like the DataLoader from PyTorch
- Could be implemented as a wrapper for the DataLoader in a Liesel node/variable
- Gianmarco has a related implementation that he uses for VI
Working with multiple models, model stacking and averaging (Section 8.2)
- None of us has ever worked with model averaging etc., so it's probably not a priority for us
- But philosophically it seems nice

Thank you @hriebl

liesel-devs / liesel

Evaluate what parts of the Bayesian Workflow to cover #56