Closed xpbowler closed 3 months ago
Uses an implementation of the Elo rating system: https://en.wikipedia.org/wiki/Elo_rating_system
GPT-4o
command-r
BM25
rank_zephyr
A battle is eligible for updating a leaderboard following the conditions stated below:
llm
retriever and/or reranker
retriever
reranker
MR: Elo Leaderboard feature
Uses an implementation of the Elo rating system: https://en.wikipedia.org/wiki/Elo_rating_system
We have 3 leaderboards:
GPT-4o
,command-r
)BM25
+rank_zephyr
)BM25
+rank_zephyr
+GPT-4o
)A battle is eligible for updating a leaderboard following the conditions stated below:
llm
retriever and/or reranker
llm
's are not equal and either theretriever
orreranker
's are not equal