Open wzhiyuan2016 opened 10 months ago
hi The smaller the kl divergence value, the more similar it is. So it should be to increase contribution, so add a negative sign before gamma. Am I right in understanding this way?
hi The smaller the kl divergence value, the more similar it is. So it should be to increase contribution, so add a negative sign before gamma. Am I right in understanding this way?