QuivrHQ / quivr

Opiniated RAG for integrating GenAI in your apps 🧠 Focus on your product rather than the RAG. Easy integration in existing products with customisation! Any LLM: GPT4, Groq, Llama. Any Vectorstore: PGVector, Faiss. Any Files. Anyway you want.
https://core.quivr.com
Other
36.71k stars 3.59k forks source link

feat: Add SpaCy Processor for Enhanced NLP Support in Quivr #3468

Open Sahil-2101 opened 1 week ago

Sahil-2101 commented 1 week ago

Description

This pull request introduces the SpaCyProcessor class to handle various text file types (PDF, DOCX, TXT, and CSV) and perform NLP processing using spaCy. This addition includes:

Key Features:

Motivation: This feature adds support for spaCy NLP processing to enable richer text analysis and processing across various file types. The processor now efficiently handles different file formats, extracts meaningful text, and applies NLP, making it easier to work with structured document data in downstream applications.

Checklist before requesting a review

Please delete options that are not relevant.

Screenshots (if appropriate):

StanGirard commented 1 week ago

Very nice PR! We'll take a closer look :) @AmineDiro