ServiceNow / AgentLab

Other
43 stars 17 forks source link

Study to multi eval #126

Closed recursix closed 2 weeks ago

recursix commented 2 weeks ago

Sequential studies for being able to launch sequentially multiple agents on webarena with resets.

also add AbstractStudy class to define API and extract reusable code