SkyworkAI / agent-studio

Benchmarks, environments, and toolkits for general computer agents
https://skyworkai.github.io/agent-studio/
GNU Affero General Public License v3.0
153 stars 11 forks source link

Add dataset evaluation script & local annotator #33

Closed ltzheng closed 5 months ago