deepset-ai / haystack

:mag: AI orchestration framework to build customizable, production-ready LLM applications. Connect components (models, vector DBs, file converters) to pipelines or agents that can interact with your data. With advanced retrieval methods, it's best suited for building RAG, question answering, semantic search or conversational agent chatbots.
https://haystack.deepset.ai
Apache License 2.0
16.66k stars 1.83k forks source link

feat: support bi-encoder models in TransformerSimilarityRanker #8245

Open Amnah199 opened 3 weeks ago

Amnah199 commented 3 weeks ago

Is your feature request related to a problem? Please describe. The current implementation of TransformerSimilarityRanker only supports cross-encoder models, which limits the use of bi-encoder models like ColBERT v2.0.

Related discussion: Colbert as reranker

Describe the solution you'd like Update the TransformerSimilarityRanker or creating a new component to support bi-encoder models like ColBERT.

Describe alternatives you've considered Leave the current implementation as is.

Additional context Similar implementation: Llama ColbertRerank

peteriz commented 3 weeks ago

fastRAG is an extension of Haystack and has a Bi-encoder similarity ranker. You're invited to check it out here. And also ColBERT and PLAID support ..

@julian-risch We will be happy to contribute the bi-encoder ranker upstream to Haystack.