Azure-Samples / azure-search-openai-demo

A sample app for the Retrieval-Augmented Generation pattern running in Azure, using Azure AI Search for retrieval and Azure OpenAI large language models to power ChatGPT-style and Q&A experiences.
https://azure.microsoft.com/products/search
MIT License
5.59k stars 3.75k forks source link

When using image embeddings, some image embeddings may be skipped #1675

Open pamelafox opened 3 weeks ago

pamelafox commented 3 weeks ago

This is replicable with the sample data. When running prepdocs, you'll see that several pages aren't represented in the sections uploaded, in that there are no sections with corresponding sourcepage equal to that page number, and thus no sections with an imageEmbedding corresponding to that sourcepage. That means some answers may be lower quality, as they don't find the relevant matching image.

Possible approaches: