langgenius / dify

Dify is an open-source LLM app development platform. Dify's intuitive interface combines AI workflow, RAG pipeline, agent capabilities, model management, observability features and more, letting you quickly go from prototype to production.
https://dify.ai
Other
51.56k stars 7.45k forks source link

there is no image upload feature in agent mode and needs to support image upload/input #10616

Closed Modas-Li closed 1 day ago

Modas-Li commented 1 day ago

Self Checks

1. Is this request related to a challenge you're experiencing? Tell me about your story.

image there is no image upload feature in agent mode and needs to support image upload/input

2. Additional context or comments

my dify version is 0.11.0.

3. Can you help us with this feature?

crazywoola commented 1 day ago

The LLM with the eye icon has the ability to read image. Please switch to another model with this ability.

image

https://github.com/langgenius/dify/issues/10574