hierarchical KL loss - Githubissues

Hi! Very Impressive work, thanks for sharing! I have a question regarding the hierarchical KL loss. As in the original paper, the hierarchical kl loss is stated as:

∑_L [KL(q(zl | x, z(l-1)) || p(zl | z(l-1)))]

,between encoder and decoder.

I am wording why did you model the KL loss between p(zl | x, z(l-1)) and p(zl | z(l-1)), which both are from decoder? mu, log_var = self.condition_z[i](decoder_out).chunk(2, dim=1) delta_mu, delta_log_var = self.condition_xz[i](torch.cat([xs[i], decoder_out], dim=1)).chunk(2, dim=1) kl_losses.append(kl_2(delta_mu, delta_log_var, mu, log_var))

Please let me know if there are any misunderstandings. Thanks a lot in advance!:)

GlassyWing / nvae

hierarchical KL loss #17