Significant-Gravitas / AutoGPT

AutoGPT is the vision of accessible AI for everyone, to use and to build on. Our mission is to provide the tools, so that you can focus on what matters.
https://agpt.co
MIT License
163.66k stars 43.46k forks source link

Add support for Image, Video, and Audio input into Forge and AutoGPT #7152

Open ntindle opened 1 month ago

ntindle commented 1 month ago

Duplicates

Summary 💡

Adding support for Image, Video, and Audio inputs into the AutoGPT system is more than just supporting it at the fastapi server level, it includes passing them through the MultiProvider for LLMs and checking which LLMs support which features as part of their configs.

Examples 🌈

No response

Motivation 🔦

The future of Agents is multimodal

ntindle commented 2 weeks ago

From boosterbot.ai, before we setup the integration with GitHub to auto comment the responses