labring / FastGPT

FastGPT is a knowledge-based platform built on the LLMs, offers a comprehensive suite of out-of-the-box capabilities such as data processing, RAG retrieval, and visual AI workflow orchestration, letting you easily develop and deploy complex question-answering systems without the need for extensive setup or configuration.
https://fastgpt.in
Other
16.76k stars 4.47k forks source link

【建议】PDF、PPT文档解析优化 #2056

Open Essence9999 opened 1 month ago

Essence9999 commented 1 month ago

例行检查

功能描述 PDF:可参考有道https://www.modelscope.cn/models/netease-youdao/QAnything-pdf-parser PPT暂无合适解析方案

Dr-xiaoming commented 1 month ago

https://github.com/CosmosShadow/gptpdf

ws02589111 commented 1 month ago

这个项目: https://github.com/opendatalab/MinerU 以及其依赖的项目也可以做参考: https://github.com/opendatalab/PDF-Extract-Kit