Drag and Drop RAG

Overview

This project is a Retrieval-Augmented Generation (RAG) pipeline that enables users to upload data (CSV, JSON, PDF, or DOCX files), store it in a Chroma vector store, and interact with it through a chatbot powered by Gemini (version gemini-1.5-pro). The chatbot retrieves relevant data from the uploaded files, enhances user queries, and returns responses using LLMs.

Features

Upload CSV, JSON, PDF, or DOCX files – Supports multiple file types and allows users to select columns for vector search.
Store and retrieve vector embeddings using Chroma.
Interactive chatbot using the Gemini API to generate responses based on user queries and stored data.
Customizable LLMs – Select columns from which the LLM should answer queries.

Running the Application

Clone the repository to your local machine:

git clone https://github.com/bangoc123/drop-rag.git
cd drop-rag

Install the required Python packages:
```
pip install -r requirements.txt
```
Run the Streamlit app:
```
streamlit run app.py
```
Access the application on http://localhost:8501 in your browser.

Steps to Use:

Upload Data: Upload a CSV, JSON, PDF, or DOCX file. Select the column to be indexed for vector search.
Save Data: The file is saved in the Chroma vector store with vector embeddings generated by the all-MiniLM-L6-v2 or keepitreal/vietnamese-sbert model.
Setup LLMs: Enter your Gemini API key to configure the chatbot for generating responses. Get the key here.
Chat: Start interacting with the chatbot, which retrieves and augments responses using the data from the uploaded file.

Notes:

A Gemini API key is required to set up the chatbot.
Ensure that data is saved to the Chroma vector store before interacting with the chatbot.

Troubleshooting:

If no data is retrieved during a query, ensure that the correct columns are selected for answering queries.
Make sure the API key is valid and the collection has been initialized before interacting with the chatbot.

This reflects the latest update, including the correct repository link.

bangoc123 / drop-rag