izmailovpavel / understandingbdl

Other
231 stars 38 forks source link

deep ensembles being simple ensembles? #26

Open psteinb opened 4 months ago

psteinb commented 4 months ago

Hi,

thanks for sharing your code with the paper. I was checking the deep-ensemble experiments (e.g. 1d-regression_deep_ensembles.ipynb). It feels like your implementation diverges from the original Deep Ensemble paper considerably. Any reason why that is?

Best, Peter

izmailovpavel commented 4 months ago

Hey @psteinb, thanks for checking out the code. In what way does it diverge from the deep ensembles paper?

psteinb commented 4 months ago

So, there are 3 distinct aspects many people overlook when implementing DeepEnsembles. The paper is very brief in these aspects and hence, I do understand why they are overlooked.

izmailovpavel commented 4 months ago

So this is just a difference in the underlying model that we apply the deep ensembles methodology to. You can use a homoscedastic noise model (single head) or a heteroscedastic model. In our case, we consider a homoscedastic fixed observation noise model, which also gives rise to the loss function that we use. The mean / variance of ensemble predictions is calculate for the purposes of visualization.

So this is a difference in the underlying model that we use in this particular notebook and not in the approximate inference methodology (deep ensembles).