Denser Retriever
[![Python Version](https://img.shields.io/pypi/pyversions/denser-retriever.svg)](https://pypi.org/project/denser-retriever/)
[![Dependencies Status](https://img.shields.io/badge/dependencies-up%20to%20date-brightgreen.svg)](https://github.com/denser-org/denser-retriever/pulls?utf8=%E2%9C%93&q=is%3Apr%20author%3Aapp%2Fdependabot)
[![Code style: ruff](https://img.shields.io/badge/code%20style-black-000000.svg)](https://github.com/astral-sh/ruff)
[![Security: bandit](https://img.shields.io/badge/security-bandit-green.svg)](https://github.com/PyCQA/bandit)
[![Pre-commit](https://img.shields.io/badge/pre--commit-enabled-brightgreen?logo=pre-commit&logoColor=white)](https://github.com/denser-org/denser-retriever/blob/main/.pre-commit-config.yaml)
[![Semantic Versions](https://img.shields.io/badge/%20%20%F0%9F%93%A6%F0%9F%9A%80-semantic--versions-e10079.svg)](https://github.com/denser-org/denser-retriever/releases)
[![License](https://img.shields.io/github/license/denser-org/denser-retriever)](https://github.com/denser-org/denser-retriever/blob/main/LICENSE)
![Coverage Report](assets/images/coverage.svg)
An enterprise-grade AI retriever designed to streamline AI integration into your applications, ensuring cutting-edge accuracy.
π Description
Denser Retriever combines multiple search technologies into a single platform. It utilizes gradient boosting (
xgboost) machine learning technique to combine:
-
Keyword-based searches that focus on fetching precisely what the query mentions.
-
Vector databases that are great for finding a wide range of potentially relevant answers.
-
Machine Learning rerankers that fine-tune the results to ensure the most relevant answers top the list.
-
Our experiments on MTEB datasets show that the combination of keyword search, vector search and a reranker via a xgboost model (denoted as ES+VS+RR_n) can significantly improve the vector search (VS) baseline.
- Check out Denser Retriever experiments using the Anthropic Contextual Retrieval dataset at here.
π Features
The initial release of Denser Retriever provides the following features.
- Supporting heterogeneous retrievers such as keyword search, vector search, and ML model reranking
- Leveraging xgboost ML technique to effectively combine heterogeneous retrievers
- State-of-the-art accuracy on MTEB Retrieval benchmarking
- Demonstrating how to use Denser retriever to power an end-to-end applications such as chatbot and semantic search
π¦ Installation
We recommend installing Python via Anaconda, as we have received feedback about issues with Numpy installation when using the installer from https://www.python.org/downloads/. We are working on providing a solution to this problem. To install Denser Retriever, you can run:
Pip
pip install git+https://github.com/denser-org/denser-retriever.git#main
Poetry
poetry add git+https://github.com/denser-org/denser-retriever.git#main
π Documentation
The official documentation is hosted on retriever.denser.ai.
Click here to get started.
π¨πΌβπ» Development
You can start developing Denser Retriever on your local machine.
See DEVELOPMENT.md for more details.
π‘ License
This project is licensed under the terms of the MIT
license.
See LICENSE for more details.
π Citation
@misc{denser-retriever,
author = {denser-org},
title = {An enterprise-grade AI retriever designed to streamline AI integration into your applications, ensuring cutting-edge accuracy.},
year = {2024},
publisher = {GitHub},
journal = {GitHub repository},
howpublished = {\url{https://github.com/denser-org/denser-retriever}}
}