High dimensions log(z) bias

joshspeagle / dynesty

Dynamic Nested Sampling package for computing Bayesian posteriors and evidences

https://dynesty.readthedocs.io/

MIT License

347 stars 76 forks source link

High dimensions log(z) bias #289

Closed segasai closed 3 years ago

segasai commented 3 years ago

As seen in #285 there is a bias in logz at high dimension count.

I've briefly checked and that seems to be not correlated with the number of live points (see below) This is kind'a suspicious as you'd expect that any numerical/precision issues to be smaller when you increase nlive. The bias is pretty substantial of 50 dex. I'm wondering if this is a manifestation of some kind of a bug somewhere as opposed to inaccuracies.

In [6]: rstate=np.random.RandomState(33);co=test_highdim.Config(rstate,32);X=test_highdim.do_gaussian(co,sample='rslice',bound='multi',nlive=500,rstate=rstate)

In [7]: co.logz_truth_gau
Out[7]: -243.22887870534663

In [8]: print (X)
(-195.13836523939548, 1.0181387163605187)

In [9]: rstate=np.random.RandomState(33);co=test_highdim.Config(rstate,32);X=test_highdim.do_gaussian(co,sample='rslice',bound='multi',nlive=1500,rstate=rstate)

In [10]: print (X)
(-197.90183809316025, 1.022940499442747)

In [11]: rstate=np.random.RandomState(33);co=test_highdim.Config(rstate,32);X=test_highdim.do_gaussian(co,sample='rslice',bound='multi',nlive=5000,rstate=rstate)

In [12]: print (X)
(-195.56476086603644, 1.0202571010884396)

segasai commented 3 years ago

Apparently increasing the number slices to 20 helps reducing the bias

In [2]: rstate=np.random.RandomState(33);co=test_highdim.Config(rstate,32);X=test_highdim.do_gaussian(co,sample='rslice',bound='multi',nlive=500,rstate=rstate)

In [3]: X
Out[3]: (-232.03421701937455, 1.0847506259228943)

In [4]: co.logz
Out[4]: -243.22887870534663

So I guess the bias is caused by correlated samples. I'm a bit surprised, as I'd thought slice sampling inside the ellipse should mix really well, but I guess in high-D spaces, that's not true anymore.

joshspeagle commented 3 years ago

I think I've mentioned this in a thread a while back, but this makes sense to me: the autocorrelation time for slice sampling scales as O(d)-ish (I think the PolyChord paper actually states this somewhere explicitly) and correlated samples lead to an overestimate of the prior volume compression and therefore an overestimate of the logZ (i.e. the positive bias you've been seeing). In slice, because you iterated through all the dimensions each time, this was implicitly taken into account, But in rslice, like in rwalk, you generally should be setting the number of steps to scale at least somewhat with dimensionality.

There's the additional issue of more live points giving better bounding distributions and therefore better proposals in the whitened space, but I think you've found at least for very small rslice that this is a subdominant effect to increasing the number of steps.

Tentatively closing this issue for now.

segasai commented 3 years ago

I think that asks for changing the defaults... (as mentioned in other issue, in my opinion the defaults should be ideally the most sensible possible)

This the test with nslice=5 (old) nslice = 5+ndim (new) nslice = 5+.25*ndim ( new0.25) nslice = max(5,ndim) (newmax5)

I'm suggesting to change the default value of slices to max(5,ndim)

joshspeagle commented 3 years ago

This is fine with me. If you want to submit a PR to change the default options to be something like:

3 + ndim for rslice
3 for slice (since this includes the ndim scaling; you can also keep this at 5 if you like or whatever appears to work best from tests)
20 + ndim for rwalk and rstagger

that probably should get most of the defaults in the right place

segasai commented 3 years ago

Ok, I'll do. (I used 5 as a default in my test, because I thought that was a default, but it seems it is 3).

joshspeagle commented 3 years ago

No you're right -- it has been 5, but I think if we're adding ndim for most problems we can reduce it to 3 without any real problems.