Hi,
Is it going to minimize the cross-entropy between (latent) sparse GP posterior and (rearranged) VAE approximate posterior
during the optimization? Because gauss_cross_entropy function seems to return the negative CE of the two posteriors. Another related question is the benefits for adding this ce_term in optimization, could we simply remove this term in the loss and will the results be very different?
Best wishes
John
Hi, Is it going to minimize the cross-entropy between (latent) sparse GP posterior and (rearranged) VAE approximate posterior during the optimization? Because gauss_cross_entropy function seems to return the negative CE of the two posteriors. Another related question is the benefits for adding this ce_term in optimization, could we simply remove this term in the loss and will the results be very different? Best wishes John