RadFriends performance - Githubissues

Hi @joshspeagle,

First of all, this is a really cool project! I like how it brings several new nested sampling developments together and is pure python.

You mention that RadFriends' performance is not so great because a lot of python calls are needed. I think you should not need a KD tree if you better take advantage of numpy vectorisation.

Just as an example, instead of

within = np.array([lalg.norm(ctrs[i] - x) <= self.radius for i in range(nctrs)], dtype='bool')
idxs = np.arange(nctrs)[within]

you can use:

idxs = np.where(lalg.norm(ctrs - x.reshape((1,-1)), axis=0) <= self.radius)

I have my own implementation of RadFriends in my nested_sampling repo (cneighbors.c, neighbors.py). There I use some C libraries for speed-up, which pays off a lot. I guess you do not want to do that in your pure-python library; Cython or numexpr may help though.

Also, in my experience RadFriends doesn't perform so well in some real-world applications because the parameters have very different extents. Using a standardized euclidean metric (normalise dimensions by std in each axis) instead helps a lot. SupFriends is pretty pointless in practice compared to RadFriends, I wouldn't mind if it was removed :) .

Cheers, Johannes

joshspeagle / dynesty

RadFriends performance #93