Open fritzo opened 3 years ago
yeah i think this is inherent. changing coordinate systems changes densities. scaling is raising a density to a power. raising densities to a given power in different coordinate systems will have different effects.
still, in the prototypical case with a global latent variable and N iid likelihood terms if you reparam the latent variable and then do SVI + data subsampling then everything is still fine provided that the scaling happens after the reparameterization. you're in a different coordinate system so of course the (say) mean field approximation may work better or worse but it'll still be a valid inference procedure in both coordinate systems
Consider the model
and its reparametrized version
equivalent to
Now observe that
@poutine.scale
ing the first model changes the effective rate parameter ofx
, whereas scalingmodel2
has no effect on the prior.Am I missing something? Does this mean poutine.reparam is incompatible with subsampling? Can we fix it somehow?