[Question] why does RerankingEvaluator implementation use embeddings + cos_sim instead of using similarity score from model?

embeddings-benchmark / mteb

MTEB: Massive Text Embedding Benchmark

Apache License 2.0

1.74k stars 231 forks source link

as far as i understand, reranker models, typcially take query + doc as input, and directly give a score as output. so called "cross-encoders".

cross-encoders

however, when i read RerankingEvaluator implementation link, it get embeddings from query and doc, and then calc cos similarity.

I tried to modified the code and thus use similarity score directly from model output, as expected, the evaluation results are different (higher) with the default RerankingEvaluator implementation.

My question, why does RerankingEvaluator implementation use embeddings + cos_sim instead of using similarity score from model?

embeddings-benchmark / mteb

[Question] why does RerankingEvaluator implementation use embeddings + cos_sim instead of using similarity score from model? #229