🤖 Chat with PDF locally using Ollama + LangChain

A powerful local RAG (Retrieval Augmented Generation) application that lets you chat with your PDF documents using Ollama and LangChain. This project includes both a Jupyter notebook for experimentation and a Streamlit web interface for easy interaction.

📺 Video Tutorial

✨ Features

🔒 Fully local processing - no data leaves your machine
📄 PDF processing with intelligent chunking
🧠 Multi-query retrieval for better context understanding
🎯 Advanced RAG implementation using LangChain
🖥️ Clean Streamlit interface
📓 Jupyter notebook for experimentation

🚀 Getting Started

Prerequisites

Install Ollama
- Visit Ollama's website to download and install
- Pull required models:
```
ollama pull llama2  # or your preferred model
ollama pull nomic-embed-text
```

Clone Repository

git clone https://github.com/tonykipkemboi/ollama_pdf_rag.git
cd ollama_pdf_rag

Set Up Environment

python -m venv venv
source venv/bin/activate  # On Windows: .\venv\Scripts\activate
pip install -r requirements.txt

🎮 Running the Application

Option 1: Streamlit Interface

streamlit run streamlit_app.py

Then open your browser to http://localhost:8501

Streamlit UI Streamlit interface showing PDF viewer and chat functionality

Option 2: Jupyter Notebook

jupyter notebook

Open updated_rag_notebook.ipynb to experiment with the code

💡 Usage Tips

Upload PDF: Use the file uploader in the Streamlit interface or try the sample PDF
Select Model: Choose from your locally available Ollama models
Ask Questions: Start chatting with your PDF through the chat interface
Adjust Display: Use the zoom slider to adjust PDF visibility
Clean Up: Use the "Delete Collection" button when switching documents

🤝 Contributing

Feel free to:

Open issues for bugs or suggestions
Submit pull requests
Comment on the YouTube video for questions
Star the repository if you find it useful!

⚠️ Troubleshooting

Ensure Ollama is running in the background
Check that required models are downloaded
Verify Python environment is activated
For Windows users, ensure WSL2 is properly configured if using Ollama

Common Errors

ONNX DLL Error

If you encounter this error:

DLL load failed while importing onnx_copy2py_export: a dynamic link Library (DLL) initialization routine failed.

Try these solutions:

Install Microsoft Visual C++ Redistributable:
- Download and install both x64 and x86 versions from Microsoft's official website
- Restart your computer after installation

If the error persists, try installing ONNX Runtime manually:

pip uninstall onnxruntime onnxruntime-gpu
pip install onnxruntime

CPU-Only Systems

If you're running on a CPU-only system:

Ensure you have the CPU version of ONNX Runtime:

pip uninstall onnxruntime-gpu  # Remove GPU version if installed
pip install onnxruntime  # Install CPU-only version

You may need to modify the chunk size in the code to prevent memory issues:
- Reduce chunk_size to 500-1000 if you experience memory problems
- Increase chunk_overlap for better context preservation

Note: The application will run slower on CPU-only systems, but it will still work effectively.

📝 License

This project is open source and available under the MIT License.

⭐️ Star History

Built with ❤️ by Tony Kipkemboi

tonykipkemboi / ollama_pdf_rag

readme