Open yja1 opened 4 months ago
how about add contrastive learning loss? example in Figure, vl looks like learning color and vr looks like learning shape. so , vl between color relative words as positive. others words coule as negative.
Same idea here, but I think the key is how to design the loss
how about add contrastive learning loss? example in Figure, vl looks like learning color and vr looks like learning shape. so , vl between color relative words as positive. others words coule as negative.