VAE loss seems to differ in paper and implementation

cdoersch / vae_tutorial

Caffe code to accompany my Tutorial on Variational Autoencoders

MIT License

500 stars 134 forks source link

This is the definition of the EuclideanLoss layer:

layer {
  name: "loss"
  type: "EuclideanLoss"
  bottom: "decode1neuron"
  bottom: "flatdata"
  top: "l2_error"
  loss_weight: 0
  include {
    phase: TRAIN
  }
}

loss_weight is set to 0 because this isn't actually used in the training objective. I included it just so that you could track the Euclidean loss if you wanted to (or switch it out). This example is actually using the sigmoid cross entropy as the main loss. I discuss the reasons for this in section 4.1 of the tutorial.

I've added a comment to explain this better. Thanks for pointing this out.

cdoersch / vae_tutorial

VAE loss seems to differ in paper and implementation #2