camel-ai / camel

🐫 CAMEL: Finding the Scaling Law of Agents. The first and the best multi-agent framework. https://www.camel-ai.org
https://docs.camel-ai.org/
Apache License 2.0
5.58k stars 678 forks source link

[Feature Request] Enhance the Chat Message's compatibility with multimodal content (text, images, video) #806

Closed Appointat closed 2 months ago

Appointat commented 3 months ago

Required prerequisites

Motivation

Camel-AI currently utilizes OpenAIMessage for message passing, which supports text and image content. However, as open-source multi-modal large language models (InternLM-XComposer) continue to evolve, there is a growing need for a more versatile message structure.

This issue proposes the creation of a new message data structure that can accommodate:

, in order to enable compatibility with advanced multi-modal models.

Solution

No response

Alternatives

No response

Additional context

No response

Wendong-Fan commented 2 months ago

close this issue as finished in https://github.com/camel-ai/camel/pull/787