microsoft / LLF-Bench

A benchmark for evaluating learning agents based on just language feedback
https://microsoft.github.io/LLF-Bench/
MIT License
60 stars 12 forks source link