deepset-ai / haystack

:mag: AI orchestration framework to build customizable, production-ready LLM applications. Connect components (models, vector DBs, file converters) to pipelines or agents that can interact with your data. With advanced retrieval methods, it's best suited for building RAG, question answering, semantic search or conversational agent chatbots.
https://haystack.deepset.ai
Apache License 2.0
16.93k stars 1.85k forks source link

feat: TransformerSimilarityRanker add batching across Documents during inference #8344

Closed sjrl closed 2 weeks ago

sjrl commented 2 weeks ago

Related Issues

Proposed Changes:

We added batching during inference time to the TransformerSimilarityRanker to help prevent OOMs when ranking large amounts of Documents.

How did you test it?

Added an integration test and existing tests.

Notes for the reviewer

Checklist

coveralls commented 2 weeks ago

Pull Request Test Coverage Report for Build 10772945893

Details


Files with Coverage Reduction New Missed Lines %
components/rankers/transformers_similarity.py 10 89.52%
<!-- Total: 10 -->
Totals Coverage Status
Change from base Build 10768832162: -0.02%
Covered Lines: 7096
Relevant Lines: 7860

💛 - Coveralls