ServiceNow / WorkArena

WorkArena: How Capable are Web Agents at Solving Common Knowledge Work Tasks?
https://servicenow.github.io/WorkArena/
Other
122 stars 11 forks source link

Whether the reproducible experimental code is published? #6

Closed Iriseve closed 5 months ago

Iriseve commented 6 months ago

Hello, thank you very much for your proposed benchmark work!I would like to ask whether the basline experimental code related to the use of LLM in the paper has been published?

recursix commented 6 months ago

Thanks for reaching out. The base agent has been published with BroweserGym and some of the code to run it. A wider codebase will be released in a few months for the wider set of experiments and reproducibility.