mnotgod96 / AppAgent

AppAgent: Multimodal Agents as Smartphone Users, an LLM-based multimodal agent framework designed to operate smartphone apps.
https://appagent-official.github.io/
MIT License
4.84k stars 511 forks source link

How to use Gemini vision Pro instead of GPT 4!? #65

Open Abhay-404 opened 5 months ago