HelgeSverre / ollama-gui

A Web Interface for chatting with your local LLMs via the ollama API
https://ollama-gui.vercel.app/
MIT License
515 stars 84 forks source link

Add support for multimodal models #18

Open andreaferretti opened 7 months ago

andreaferretti commented 7 months ago

In the Ollama CLI it is possible to use multimodal images (such as Llava) by issuing a request like

What's in this image? /Users/jmorgan/Desktop/smile.png

Apparently, this does not work in the Ollama GUI: the language model interprets the text as is, and does not load the content of the image