Skyvern-AI / skyvern

Automate browser-based workflows with LLMs and Computer Vision
https://www.skyvern.com
GNU Affero General Public License v3.0
10.57k stars 725 forks source link

Analysis of Skyvern on web agent Benchmarks #1207

Closed devinat1 closed 5 days ago

devinat1 commented 6 days ago

I am curious as to whether the Skyvern agent has been benchmarked on webarena or visualwebarena. It would be very interesting to see how the agent performs on an academic benchmark.

suchintan commented 5 days ago

Great idea. We're planning on doing this this month!