Discriminating Q(z|x) and P(z) versus Q(z) and P(z)

tolstikhin / wae

Wasserstein Auto-Encoders

BSD 3-Clause "New" or "Revised" License

505 stars 90 forks source link

Hi,

I am little bit confused after reading your paper. Please correct me if I misunderstood. In your paper, you show difference between VAE and WAE in terms of distribution matching objective

VAE: matching Q(z|x) to P(z)
WAE: matching Q(z) directly to P(z)

However, I wonder why you implemented WAE that matches Q(z|x) to P(z) with GAN and MMD. (ex. Discriminator still discriminates z tilda from Q(z|x) and z from P(z)

In order to match Q(z) to P(z), don't you have to calculate distance between Q(z) and P(z), in which Q(z) is obtained by marginalizing Q(z|x) with P(x)? But you are averaging distance between Q(z|x) and P(z) with multiple x.

tolstikhin / wae

Discriminating Q(z|x) and P(z) versus Q(z) and P(z) #5