Using OpenUI without Ollama and Llava?

wandb / openui

OpenUI let's you describe UI using your imagination, then see it rendered live.

https://openui.fly.dev

Apache License 2.0

18.84k stars 1.73k forks source link

Using OpenUI without Ollama and Llava? #137

Closed hukarere closed 4 months ago

hukarere commented 4 months ago

Hello,

After reading the README file, I have the following questions:

Is it possible to run OpenUI locally without Ollama?
Is Llava or any other model with image processing capabilities required for running OpenUI?
What OpenAI or Groq image processing model can be used instead of Ollama/Llava, and how?

Thanks in advance.

vanpelt commented 4 months ago

Hey @hukarere, by default OpenUI will use gpt-4o from OpenAI when an image is uploaded. If you don't upload an image and instead just chat with the model we can use any language model. Groq currently only supports language models so when choosing one of them you won't be able to upload an image.

vanpelt commented 4 months ago

Also, sorry to answer your questions directly:

Yes, just set an OPENAI_API_KEY and / or a GROQ_API_KEY
No, image models enable the ability to upload a screenshot but is not required. The tool works fine with only text.
Answered in the above comment. gpt4-o when using gpt-3.5-turbo otherwise the selected model from the "settings" menu if it supports images. For Groq, there's no image support.

hukarere commented 4 months ago

Hello @vanpelt,

Thanks for your explanations!

Perhaps it would make sense to add this detail about gpt4-o being used for image processing by default to README? Now it only mentions llava, so I had mistakenly concluded that llava is the only option...

hukarere commented 4 months ago

@vanpelt,

3. Answered in the above comment.  `gpt4-o` when using `gpt-3.5-turbo` otherwise the selected model from the "settings" menu if it supports images.  For Groq, there's no image support.

Another question: if I understood you correctly, gpt4-o is only used for uploaded images if gpt-3.5-turbo is used for text. What if I would like to use a groq model for text, but still use gpt4-o for uploaded images? Is that possible?