Expanding Auto-GPT to accept multi-modality input to reach humanoid robot level

Significant-Gravitas / AutoGPT

AutoGPT is the vision of accessible AI for everyone, to use and to build on. Our mission is to provide the tools, so that you can focus on what matters.

https://agpt.co

MIT License

166.42k stars 44.05k forks source link

Expanding Auto-GPT to accept multi-modality input to reach humanoid robot level #3393

Closed bharathraja closed 11 months ago

bharathraja commented 1 year ago

Duplicates

[X] I have searched the existing issues

Summary 💡

Can the auto-GPT be expanded to take in multi-modality input such as image, audio, touch and can act through the humanoid robot body ? The modularization of image object recognition, audio-text processing and touch based input tokenisation into text format would integrate all senses. This would make truly autonomous humanoid robots. this can be tested in the simulated environments like OpenAI gym initially.

There are literature which has expanded GPT ability to human action sequences, like this one: https://actiongpt.github.io/

Examples 🌈

No response

Motivation 🔦

i-Robot movie

zachary-kaelan commented 1 year ago

GPT can't generate text anywhere near fast enough to react to a real-time environment, and if it could then we'd be broke from the token costs of generating detailed actions 60 times a second.

ntindle commented 1 year ago

This isn’t the worst idea, could be done if the gradio tools plug-in is fixed up a bit

zachary-kaelan commented 1 year ago

I have embeddings fever and it's exciting to see that we got a latent space for motions going, but multimodal, spatiotemporal autoencoding and decoding isn't cheap.

Boostrix commented 1 year ago

this should probably be renamed to something like "multi modality" ?

github-actions[bot] commented 1 year ago

This issue has automatically been marked as stale because it has not had any activity in the last 50 days. You can unstale it by commenting or removing the label. Otherwise, this issue will be closed in 10 days.

github-actions[bot] commented 11 months ago

This issue was closed automatically because it has been stale for 10 days with no activity.