Feature: allow to use embedding models for the sentence reranking task

aws-samples / aws-genai-llm-chatbot

A modular and comprehensive solution to deploy a Multi-LLM and Multi-RAG powered chatbot (Amazon Bedrock, Anthropic, HuggingFace, OpenAI, Meta, AI21, Cohere, Mistral) using AWS CDK on AWS

MIT No Attribution

1.11k stars 332 forks source link

Cohere's Rerank 3 is multilingual @massi-ang, supporting over 100 languages. It also has 4k context length, whereas cross-encoder/ms-marco-MiniLM-L-12-v2 has a max context length of 512.

This project already supports a wide range of external APIs as options for the text-text models, so why not also include the option for folks to include their Cohere API key and use Rerank 3?

I already did a quick comparison between Rerank 2 and marco MiniLM, and Rerank 2 ranked the results much better: with my query asking about image generation regulations, Rerank 2 put image-generated related responses at the top of the list, whereas marco's responses did not include anything about image generation.

I could not find an open-source model with as much language support as Cohere Rerank 3.

If I add support for Cohere Rerank 3, would that resolve this issue @massi-ang?

aws-samples / aws-genai-llm-chatbot

Feature: allow to use embedding models for the sentence reranking task #280