landing-ai / vision-agent

Vision agent
Apache License 2.0
930 stars 94 forks source link

Vision Agent v3 #89

Closed dillonalaird closed 1 month ago

dillonalaird commented 1 month ago

A couple of pros and cons with Data Interpreter and Agent Coder lead to this design.

Data Interpreter Cons:

Data Interpreter Pros:

Agent Coder Pros:

Agent Coder Cons:

This version of vision agent basically keeps planning, but does the entire plan in one shot using the Agent Coder framework. It uses the plan to do tool recommendation and also allows for long term memory lookup. For planning beyond the initial plan it will do reflection and see if it needs to execute an additional plan.