Open harshsikka opened 1 year ago
dumping some things I've come across:
Thanks folks. Now that we're expanding the Survey effort to community, it could be useful to edit the above posts to follow the "Issue format" specified in community guidelines whenever you have a few minutes : https://docs.google.com/document/d/1LPCl8ivbPQsEx96sGBPeCY7AM8PWh7RUcy-TNNJkQ50/edit#heading=h.v41a31azfs4m
Most of the modern agent based literature has been focused on using LLMs as the core controller in an "agent". See ManifoldRG/Manifold-KB#18
This makes sense given the general applicability of language and the representational capacity of LLMs. But, what other kinds of agents are there?
We're looking to collect and understand examples of agents that might be vision input based, or even multimodal agents (language paired w/ other modalities).
Output: Candidate papers for these kinds of agents