Why are the logits on the numerator in the loss function not masked for comparing a sample with itself?

dvlab-research / Parametric-Contrastive-Learning

Parametric Contrastive Learning (ICCV2021) & GPaCo (TPAMI 2023)

https://arxiv.org/abs/2107.12028

MIT License

241 stars 32 forks source link

Why are the logits on the numerator in the loss function not masked for comparing a sample with itself? #23

Open xiaobingbuhuitou opened 1 year ago

xiaobingbuhuitou commented 1 year ago

dalao, I find that in PaCo or GPaCo the logits on the numerator in the loss function not masked, but the denominator of the loss function is masked cause exp_logits = torch.exp(logits) * logits_mask i think the the logits on the numerator also should masked? and the learnable center is used for predict ground truth to make it become a supervised question? thanks😢

xiaobingbuhuitou commented 1 year ago

sorry and i want to ask in paper the Remark 2 after use parametric contrastive learning why the probility become alpha/(1+alphaKy) and C become 1/(1+alphaKy) ? sorry i don't know how to compute it thaks 😢