camel-ai / camel

🐫 CAMEL: Finding the Scaling Law of Agents. A multi-agent framework. https://www.camel-ai.org
https://www.camel-ai.org
Apache License 2.0
5.32k stars 647 forks source link

[Feature Request] Add `Phi-3.5-vision-instruct` VLM model #849

Open lightaime opened 3 weeks ago

lightaime commented 3 weeks ago

Required prerequisites

Motivation

Add Phi-3.5-vision-instruct VLM model

Solution

No response

Alternatives

No response

Additional context

No response

tom-doerr commented 2 weeks ago

Would be interested in helping with this but I'm not quite sure what the issue currently is. Phi-3.5-vision-instruct is supported by vLLM right? So would the goal be to add image support to Camel in general?

CaelumF commented 1 week ago

Working with vLLM or ollama is fine, potentially some work is needed on the chat formatting and multi modal support