[New Feature]: expose callbacks and ainvoke

nasa-jpl / rosa

ROSA 🤖 is an AI Agent designed to interact with ROS1- and ROS2-based robotics systems using natural language queries. ROSA helps robot developers inspect, diagnose, understand, and operate robots.

https://github.com/nasa-jpl/rosa/wiki

Apache License 2.0

670 stars 49 forks source link

[New Feature]: expose callbacks and ainvoke #4

Closed ndahn closed 2 months ago

ndahn commented 3 months ago

Checked for duplicates

Alternatives considered

Yes - and alternatives don't suffice

Related problems

No, see below

Describe the feature request However, in order for the GUI to stay responsive, deferring long-running tasks is essential. At the moment it is not possible to use ROSA for intermediate updates or react to errors. It would be nice if the ROSA constructor accepted a callbacks parameter to pass to its AgentExecutor. Exposing the ainvoke method would be useful, too.

As an aside, I'm not sure if adding double underscores to all class members is really necessary. It makes inheritance and monkey patching much more annoying :)

RobRoyce commented 3 months ago

Changed the class methods from private to protected in #9

RobRoyce commented 3 months ago

Can you elaborate on the intended use case a bit more? I'd like to understand how the callbacks would be used to keep the UI responsive.

Is it that you want to be able to submit arbitrarily many queries to the agent without waiting for the previous query to finish?

How will you keep track of the queries, their intermediate steps, and results?

Mostly just curious, I think it's a good idea either way.

ndahn commented 3 months ago

Sure! Right now I'm running the request inside a thread and just update the UI from there once it returns. There are a couple of things I can't do with this approach (at least as far as I can tell):

stream responses
give user feedback about what tools are used
react to failed tool runs
cancel response generation

From what I saw, langchain uses a callback class and its member functions for handling response updates. I would construct this as I pass the request, then use a closure on my request and generated UI elements to keep the association.

RobRoyce commented 2 months ago

Check out the astream implementation in rosa.py on feature/streaming. It's using LangChain astream_events method, which provides access to both streaming tokens, tool usage, etc.

Also see turtle_agent demo script for example usage.

One thing I noticed, and have yet to find a solution to, is that LangChain AgentExecutor cannot be pre-empted or cancelled with standard LangChain facilities. This will likely require use of asyncio tasks. What is unclear to me at the moment is whether ROSA should provide that integration, or if it should be left to the consumer (I am leaning more toward the latter).

ndahn commented 2 months ago

Thank you! I'll have some time next week to test it out. I agree though that supporting request cancellation sounds like something langchain should provide.

ndahn commented 2 months ago

Tried it out just now and works like a charm! The example was already very helpful. Seems like you had some fun with it :D Thank you!