[Feature Request] Multi-Modal Agents

camel-ai / camel

🐫 CAMEL: Finding the Scaling Law of Agents. A multi-agent framework. https://www.camel-ai.org

Apache License 2.0

4.98k stars 603 forks source link

Required prerequisites

[X] I have searched the Issue Tracker and Discussions that this hasn't already been reported. (+1 or comment there if it has.)
[X] Consider asking first in a Discussion.

Motivation

The current framework uses two LLMs as the backend for role-playing. It is limited to text-only tasks. We would like to introduce multi-modal agents to solve tasks that are related to visual, audio, and so on.

Solution

No response

Alternatives

No response

Additional context

No response

camel-ai / camel

[Feature Request] Multi-Modal Agents #188

Required prerequisites

Motivation

Solution

Alternatives

Additional context