Implement measure-valued derivatives

HEmile / storchastic

Stochastic Automatic Differentiation library for PyTorch.

GNU General Public License v3.0

180 stars 5 forks source link

Measure valued derivatives are an alternative to REINFORCE/score function. See https://arxiv.org/pdf/1906.10652.pdf for a clear explanation.

It has some problems when implementing it, though! Samples are taken using the positive and negative probability components. This means that blindly applying MC won't work for downstream estimation: It's not taken from the original distribution. We can easily fix this by importance sampling using the weighting function. Furthermore, to make it compatible with auto-diff, a solution could be: $\sum_i \thetai \bot(c{\theta_i}(f(x_1)-f(x_2))$, where $x_1 \sim p^+$ and $x_2\sim p^-$.

HEmile / storchastic

Implement measure-valued derivatives #78