Asymmetric (two model) learning of pairs

Hello!

There are indeed symmetric losses in Sentence Transformers, but also a few asymmetric ones. For example MultipleNegativesRankingLoss, which is arguably the most common loss. This loss essentially trains:

Given the anchor text, which other text from all positive (and optionally negative) values in the batch is the one that corresponds with the anchor?

If you have asymmetric data like summaries, then you might be able to train:

Given a summary, which of the texts from the batch is the full document?

Given a document, which of the texts from the batch is its summary?

depending on what you choose as your first column. The resulting model becomes adept at the asymmetric task (e.g. information retrieval).

Some model authors use prefixes/prompts for each type of data, so that the model can distinguish e.g. a query of "what are pandas" and a document of "what are pandas". There's some docs on them here, but note that this only refers to inference. For training you have to manually prepend the prompts.

Sentence Transformers does not support dual model setups right now, and I don't think I'll be moving in that direction soon.

Tom Aarsen

UKPLab / sentence-transformers

Asymmetric (two model) learning of pairs #3024