Open cfoster0 opened 3 years ago
As in the CLIP paper, we should clip the contrastive loss temperature so that the logits are not scaled by more than 100. Should be relatively easy.
As in the CLIP paper, we should clip the contrastive loss temperature so that the logits are not scaled by more than 100. Should be relatively easy.