quchangle1 / LLM-Tool-Survey

This is the repository for the Tool Learning survey.
https://arxiv.org/abs/2405.17935
198 stars 8 forks source link

Consider adding AppWorld to the list #6

Closed HarshTrivedi closed 3 weeks ago

HarshTrivedi commented 3 weeks ago

Thanks for setting up this repository! If possible, I would like to add AppWorld to this list, e.g., the Benchmarks section.

🔗 Website: https://appworld.dev/ 📄 Paper: https://arxiv.org/abs/2407.18901 🐦 Tweet: https://x.com/harsh3vedi/status/1818311843976233198 💬 Blog: https://appworld.dev/blog 🎬 Video(s): https://appworld.dev/video 🌎 Code: https://github.com/stonybrooknlp/appworld 🧭 Data (task, trajectories) explorer, playground: https://appworld.dev/task-explorer 🔍 API explorer: https://appworld.dev/api-explorer 📊 Leaderboard: https://appworld.dev/leaderboard

TLDR: Introduces the AppWorld Engine, a high-fidelity execution environment of 9 day-to-day apps, operable via 457 APIs, populated with digital activities of 106 people living in a simulated world, and an associated benchmark of natural, diverse, and challenging autonomous agent tasks requiring rich and interactive coding.

quchangle1 commented 3 weeks ago

I've already added AppWorld! You can find it in the Benchmarks section.

HarshTrivedi commented 3 weeks ago

Awesome! Thank you very much @quchangle1 !!