Closed christopher-beckham closed 4 years ago
Hey @christopher-beckham, I think you are mixing up the KL[q(z|x) || p(z)] (which is usually written in losses for VAE), and \Sum_j KL[q(z_j) || p(z_j)] which is scaled by gamma
in the readme. The key thing to understand is that KL[q(z|x) || p(z)] can be decomposed in the three terms used in the readme (cf. https://arxiv.org/abs/1802.04942 and http://approximateinference.org/accepted/HoffmanJohnson2016.pdf ).
Feel free to reopen if I misunderstood what you were saying.
Ah I see now, that makes sense. Thanks!
Hi,
Looking at this part of the readme, this doesn't seem right:
The standard VAE is simply
gamma=1
with noalpha
orbeta
. For Beta-VAE it is simplygamma > 0
with again noalpha
orbeta
. Did I miss something?Thanks.