djz233 / ClusterNS

Finding of ACL2023: Clustering-Aware Negative Sampling for Unsupervised Sentence Representation
12 stars 3 forks source link

Can you explain your loss function Lcl #5

Open hamrain opened 4 months ago

hamrain commented 4 months ago

I don't understand the meaning of the denominator in your formula Lcl. Or rather, why add Xj - as the denominator when there is already a clustering group with hard negative Xj+.

djz233 commented 1 month ago

glad to see your question. IT's the default loss function in contrastive learning, such that unnecssary to discard the normal negative. In our knowledge, most (even all) related works about hard negative still use normal negative such as SimCSE, MixCSE.