Azure-Samples / azure-search-openai-demo

A sample app for the Retrieval-Augmented Generation pattern running in Azure, using Azure AI Search for retrieval and Azure OpenAI large language models to power ChatGPT-style and Q&A experiences.
https://azure.microsoft.com/products/search
MIT License
5.57k stars 3.74k forks source link

If I use the GPT-4o model, is it capable of receiving input as an image? And if there is text in the image, could it be capable of extracting the text inside the picture? #1681

Open christ25442544 opened 3 weeks ago

christ25442544 commented 3 weeks ago

Please provide us with the following information:

As the topic said, If I use gpt-4o, could it be possible to read the input image from user and generate the response based on the user input?

This issue is for a: (mark with an x)

- [ ] bug report -> please search issues before submitting
- [x ] feature request
- [ ] documentation issue or request
- [ ] regression (a behavior that used to work and stopped in a new release)

Minimal steps to reproduce

None

Any log messages given by the failure

Expected/desired behavior

OS and Version?

Windows 7, 8 or 10. Linux (which distribution). macOS (Yosemite? El Capitan? Sierra?)

azd version?

run azd version and copy paste here.

Versions

Mention any other details that might be useful


Thanks! We'll be in touch soon.

joshuadmatthews commented 3 weeks ago

I would also like to see this functionality added.