RobinQu / instinct.cpp

instinct.cpp provides ready to use alternatives to OpenAI Assistant API and built-in utilities for developing AI Agent applications (RAG, Chatbot, Code interpreter) powered by language models. Call it langchain.cpp if you like.
Apache License 2.0
38 stars 3 forks source link

Modular RAG implementations: reranker, multi-retrievers #16

Closed RobinQu closed 5 months ago

RobinQu commented 7 months ago

For better evaluation result in HF QA dataset.

RobinQu commented 7 months ago

more complex rag pipeline may invole agent frameworks #18

Adaptive RAG

Image

https://github.com/langchain-ai/langgraph/blob/70c1c996a4c9fe8df518bcd849b3c6453dd0d58b/examples/rag/langgraph_adaptive_rag.ipynb

RobinQu commented 6 months ago

Rerank, Colbert, ...

Related work

Methods

Opensourced Projects

image

image

image

Conlcusion

RobinQu commented 6 months ago

Timeline

RobinQu commented 6 months ago

OpenAI officials parameter for RAG: https://platform.openai.com/docs/assistants/tools/file-search/how-it-works

By default, the file_search tool uses the following settings: Chunk size: 800 tokens Chunk overlap: 400 tokens Embedding model: text-embedding-3-large at 256 dimensions Maximum number of chunks added to context: 20 (could be fewer)

Supported file formats: https://platform.openai.com/docs/assistants/tools/file-search/supported-files

Image