langgenius / dify

Dify is an open-source LLM app development platform. Dify's intuitive interface combines AI workflow, RAG pipeline, agent capabilities, model management, observability features and more, letting you quickly go from prototype to production.
https://dify.ai
Other
51.29k stars 7.39k forks source link

Does RAG not support the epub file format? #3708

Closed cgoder closed 6 months ago

cgoder commented 6 months ago

Self Checks

Dify version

0.6.4

Cloud or Self Hosted

Self Hosted (Docker)

Steps to reproduce

Does RAG not support the epub file format? image

✔️ Expected Behavior

docke version support epub file, same with cloud version.

❌ Actual Behavior

dosubot[bot] commented 6 months ago

Hey there, @cgoder! 🌟 I'm Dosu, a bot here to help you with bugs, answer your questions, and guide you on becoming a contributor while we wait for a human maintainer to join us. Think of me as your first line of support and a fellow community member. Diving into your issue now, and I'll get back to you with a full answer shortly! 🚀

crazywoola commented 6 months ago

https://github.com/langgenius/dify/blob/ad65c891e7f38958095dce16584d641537993bd5/api/core/rag/extractor/extract_processor.py#L110

See here, if you config the if etl_type == 'Unstructured':, then it should be supported.