Google Docs support - Githubissues

llmware-ai / llmware

Unified framework for building enterprise RAG pipelines with small, specialized models

Apache License 2.0

5.78k stars 1.43k forks source link

LLMWare provides extensive built-in parsing capability for Microsoft Document types (PPTX, DOCX, and XLSX), but does not currently integrate a solution for parsing and integration of Google Docs, Slides and Sheets - along with potential connections into Google Drive repositories for storing and accessing documents.

It would be great to have an integrated capability that supports parsing, text chunking and ingestion of Google document types and repositories. This implementation could take several forms - from a de novo parser/text chunker in Python or C/C++ or more likely an interface into an existing Google document parser - with the supporting code to seamlessly integrate into LLMWare.

llmware-ai / llmware

Google Docs support #1022