e2b-dev / awesome-ai-sdks

A database of SDKs, frameworks, libraries, and tools for creating, monitoring, debugging and deploying autonomous AI agents
742 stars 62 forks source link

Add ToolEmu #8

Open tizkovatereza opened 9 months ago

tizkovatereza commented 9 months ago

ToolEmu

An LLM-based emulation framework for flexibly testing and identifying the risks of LLM-based agents across various tools & scenarios

### Description - ToolEmu leverages advanced LLMs (like GPT-4) as an emulator to emulate tool execution and automatically instantiate scenarios for risk assessment in a virtual sandbox - ToolEmu enables: - flexibly prototyping LLM-based agents equipped with tools without the need of actual tool implementations - seamlessly testing LLM-based agents in rare and risk-critical scenarios without the need of actual sandbox setups - identifying potential realistic failures of LLM-based agents ### Links - [GitHub](https://github.com/ryoungj/ToolEmu) - [Website](https://toolemu.com/) - [Demo](https://demo.toolemu.com/) - [Paper](https://arxiv.org/abs/2309.15817) - [Tweet](https://twitter.com/YangjunR/status/1708880142649676056)