Vision Image Support? - Githubissues

latitude-dev / latitude-llm

Latitude is the open-source prompt engineering platform to build, evaluate, and refine your prompts with AI

https://latitude.so

GNU Lesser General Public License v3.0

891 stars 57 forks source link

Vision Image Support? #352

Open hongkongkiwi opened 1 month ago

hongkongkiwi commented 1 month ago

Does model support vision/images ?

This is a key feature for me and I'd love to be able to test some prompts with images.

geclos commented 1 month ago

hey @hongkongkiwi , we don't currently support images at this point (we started this project just a few weeks ago), but definitely something we wanna do in the future.

csansoon commented 1 month ago

Hello! We do support image recognition for supported models like gpt-4o, but the source image must be currently hosted somewhere else. Adding an image tag inside a user message will let the LLM analyse the contents of the image:

---
provider: Latitude
model: gpt-4o
---

Describe what you can see in this image:

<user>
  <image>{{image_url}}</image>
</user>