Negative loss - Githubissues

facebookresearch / suncet

Code to reproduce the results in the FAIR research papers "Semi-Supervised Learning of Visual Features by Non-Parametrically Predicting View Assignments with Support Samples" https://arxiv.org/abs/2104.13963 and "Supervision Accelerates Pre-training in Contrastive Semi-Supervised Learning of Visual Representations" https://arxiv.org/abs/2006.10803

MIT License

487 stars 67 forks source link

Hi @chrishendra93, thanks for your interest.

Yes, exactly that's totally normal, not a problem. The loss is still being minimized, regardless of whatever its value is.

If you really don't like looking at a negative loss, you can add a constant to the me-max regularizer. It won't change the gradient at all. For example, maximizing entropy of the average prediction is equivalent to minimizing the KL divergence to the uniform distribution. Therefore, you can add math.log(len(avg_probs)) to the me-max regularizer here (i.e., turning it into the KL divergence), and it will have no effect on the actual training, giving you the same results, but ensuring that the loss is always positive.

facebookresearch / suncet

Negative loss #21