Any ideas on how to start on this? Calling viz.marginals on a probmods example with >100,000 samples (and four variables in the joint distribution) takes an order of magnitude more time than inference
Density estimation for continuous data is implemented naively. There are various tricks (e.g., FFT and tree-based computation, other stuff) to make this faster but it might not be worth the effort, given that we'll probably want kernel-based aggregators in core webppl anyway (probmods/webppl#369). Also, if I had to guess, the main bottleneck is probably the projection, not kde.
Any ideas on how to start on this? Calling
viz.marginals
on a probmods example with >100,000 samples (and four variables in the joint distribution) takes an order of magnitude more time than inference