Can you please explain how hard negatives are incorporated in the loss function? I'm confused with the current implementation.
In the code here weights is just a tensor of zeros, because z3_weight is 0.0. So when you add it to cosine similarity here it doesn't change anything, is it? So the final loss function is somewhat different from the one in the paper under eq-n (5).
Can you please explain how hard negatives are incorporated in the loss function? I'm confused with the current implementation. In the code here weights is just a tensor of zeros, because z3_weight is 0.0. So when you add it to cosine similarity here it doesn't change anything, is it? So the final loss function is somewhat different from the one in the paper under eq-n (5).