QuivrHQ / quivr

Open-source RAG Framework for building GenAI Second Brains 🧠 Build productivity assistant (RAG) ⚡️🤖 Chat with your docs (PDF, CSV, ...) & apps using Langchain, GPT 3.5 / 4 turbo, Private, Anthropic, VertexAI, Ollama, LLMs, Groq that you can share with users ! Efficient retrieval augmented generation framework
https://quivr.com
Other
36.18k stars 3.52k forks source link

[Feature]: it should handle large pdf files and epub files #2630

Closed dality17 closed 1 month ago

dality17 commented 4 months ago

The Feature

i just want to say im amazed how much it improved. And it really is awsome to see it can point out all the sources when im asking a guestion. This is so amazing project! all they need to do is merge memgpt (the capacity to improve itself by every chat) Surely then this project would be unstoppable.

but there is something i want to request. i want to request that it can handle epub files larger pdf files. Because handling larger files efficiently is crucial for many users, especially those dealing with extensive documents or datasets. Improving the AI's capability in this aspect could significantly enhance its usefulness in various scenarios.

Motivation, pitch

Granting the request to handle larger PDF files and eBugs is essential for several reasons:

Enhanced Productivity: With the ability to handle larger files, users can process and analyze extensive documents or datasets more efficiently. This capability streamlines workflows and reduces the time required to manage and work with large volumes of information.

Improved Versatility: By accommodating larger files, the AI becomes more versatile and applicable across a wider range of tasks and industries. It can support researchers, analysts, educators, and professionals in various fields who rely on the processing and analysis of substantial amounts of data.

Better User Experience: Enabling the AI to handle larger files enhances the overall user experience by providing a seamless and uninterrupted workflow. Users can work with their documents or datasets without encountering limitations related to file size, resulting in a smoother and more productive interaction with the AI.

github-actions[bot] commented 1 month ago

Thanks for your contributions, we'll be closing this issue as it has gone stale. Feel free to reopen if you'd like to continue the discussion.