eidolon-ai / eidolon

The first AI Agent Server, Eidolon is a pluggable Agent SDK and enterprise ready, deployment server for Agentic applications
https://www.eidolonai.com/
Apache License 2.0
285 stars 31 forks source link

Interactive browser sessions #928

Open flynntsang opened 2 days ago

flynntsang commented 2 days ago

As an automation engineer I want autonomous agents to perform online work that otherwise require human intervention or a lot of custom code.

This may require a long-lived session in cases where state is built up over time (ie, in-browser manipulation, authentication, etc).

Configuration

This likely involves exposing playwright more or less directly to the agent. Perhaps with scoped url allowances. TODO investigate what this actually looks like

We need to look at playwright api to understand how external navigation / popups work. Presumably the agent should be able to navigate these as well as long as it is allowed by url rules.

When continuing session we need ways for the agent to get the current state and manipulate the DOM. This is likely a passthrough of the playwright api.

File downloads need to be handled gracefully in a way accessible to the agent / user, so, for example, they can be uploaded to external locations. Hooking into the existing "files" api is likely graceful way to handle this.

Cases to consider