Open JuHyung-Son opened 3 months ago
@crazywoola
I'm not sure how to add this feature, I thought Layout Analysis is an api so it should go in model_runtime, but according to the documentation it only supports models that can do the following tasks.
LLM - LLM text completion, dialogue, pre-computed tokens capability Text Embedding Model - Text Embedding, pre-computed tokens capability Rerank Model - Segment Rerank capability Speech-to-text Model - Speech to text capability Text-to-speech Model - Text to speech capability Moderation - Moderation capability
@JuHyung-Son
Hello, sorry for the late response.
I think you are looking for this, right?
And if you would like to contribute, you can fulfill something like below.
@JuHyung-Son
Hi, I just found that Upstage's Document Parse is way better than any other tools. Is there any plan for or progress in incorporating Document Parse into Dify?
opensourced
Where is the open source repo?
I can't find it in their github account:
Self Checks
1. Is this request related to a challenge you're experiencing? Tell me about your story.
It is hard to index unstructured documents like html, pdfs. If unstructured document parsing api is available, it will very powerful tool for dify.
and upstage Layout analysis is world leading unstructured document parsing api. you can check its performance here.
https://en.content.upstage.ai/blog/business/introducing-layout-analysis https://developers.upstage.ai/docs/apis/layout-analysis
Also, this api is going to be opensourced.
2. Additional context or comments
No response
3. Can you help us with this feature?