janhq / jan

Jan is an open source alternative to ChatGPT that runs 100% offline on your computer. Multiple engine support (llama.cpp, TensorRT-LLM)
https://jan.ai/
GNU Affero General Public License v3.0
22.63k stars 1.3k forks source link

idea: Allow RAG with .txt files #3779

Open 4722794 opened 15 hours ago

4722794 commented 15 hours ago

Problem Statement

Hi, am I missing something?

The current tool only supports pdfs. Why not .txt files?

Feature Idea

Add support for pdfs and .txt files for RAG in tools.

4722794 commented 14 hours ago

Also, I did go through the documentation here: https://jan.ai/docs/tools/retrieval#enable-the-knowledge-retrieval

But I just don't get it. On my system, there is no provision to select a model. I was hoping to select some embedding model like text-embedding-3.

Screenshot 2024-10-10 at 8 20 34 PM

And where is it storing the chunks? Is it creating a vector database somewhere?