YorkieDev / DocuMind

MIT License
14 stars 3 forks source link

DocuMind: Your Intelligent Document Assistant 🧠

DocuMind is an advanced document processing and question-answering system that leverages the power of Large Language Models (LLMs) and Retrieval-Augmented Generation (RAG) to provide intelligent responses based on your documents and web content. (Note: This has only been tested on MacOS, might not work on other operating systems - it's Open Source so feel free to download and modify it to get it to work on your end)

Features 🌟

Screenshots 📸

image image image

Requirements 📋

Setup 🚀

  1. Clone the repository:

    git clone https://github.com/yorkiedev/documind.git
    cd documind
  2. Install the required dependencies:

    pip install -r requirements.txt
  3. Set up LM Studio:

    • Ensure LM Studio is running and accessible at http://localhost:1234/v1
    • Configure your LM Studio with appropriate models and presets
  4. Run the Streamlit app:

    streamlit run ragcrawler.py

Usage 📚

  1. Document Upload:

    • Use the sidebar to upload your documents (TXT, DOCX, PDF formats supported).
    • The system will process and index the documents automatically.
  2. Web Crawling:

    • Enter a URL in the sidebar and specify the maximum number of pages to crawl.
    • Click "Crawl and Add to RAG" to add the web content to your knowledge base.
  3. Chatting with DocuMind:

    • Type your questions in the chat input at the bottom of the page.
    • DocuMind will provide answers based on the content of your documents and crawled web pages.
  4. Viewing Responses:

    • Responses may include formatted text, LaTeX equations, and code snippets.
    • LaTeX equations are rendered inline for easy reading.
    • Code snippets are displayed with syntax highlighting.
  5. Managing Documents:

    • Use the "Clear All Documents" button in the sidebar to reset your knowledge base.

Customization 🛠️

Contributing 🤝

Contributions to DocuMind are welcome! Please feel free to submit pull requests, create issues or spread the word.

License 📄

This project is licensed under the MIT License - see the LICENSE file for details.

Acknowledgments 👏


Happy document processing and chatting with DocuMind! If you have any questions or run into issues, please open an issue on GitHub. Enjoy! 🎉