Open spoorgholi74 opened 4 years ago
Because VAE would be trained with a reconstruction loss which not only ensures input information are preserved but also creates a rich low dimensional latent space on which the discriminator would be trained better.
Hi, I was wondering why did you use a VAE to get the embedding instead of just the output of a middle layer from a CNN?