Open psteinb opened 4 months ago
Hey @psteinb, thanks for checking out the code. In what way does it diverge from the deep ensembles paper?
So, there are 3 distinct aspects many people overlook when implementing DeepEnsembles. The paper is very brief in these aspects and hence, I do understand why they are overlooked.
So this is just a difference in the underlying model that we apply the deep ensembles methodology to. You can use a homoscedastic noise model (single head) or a heteroscedastic model. In our case, we consider a homoscedastic fixed observation noise model, which also gives rise to the loss function that we use. The mean / variance of ensemble predictions is calculate for the purposes of visualization.
So this is a difference in the underlying model that we use in this particular notebook and not in the approximate inference methodology (deep ensembles).
Hi,
thanks for sharing your code with the paper. I was checking the deep-ensemble experiments (e.g. 1d-regression_deep_ensembles.ipynb). It feels like your implementation diverges from the original Deep Ensemble paper considerably. Any reason why that is?
Best, Peter