Multilingual referencegame and imagegame (adjustments)

clp-research / clembench

A Framework for the Systematic Evaluation of Chat-Optimized Language Models as Conversational Agents and an Extensible Benchmark

MIT License

26 stars 34 forks source link

Closed SandraNeuhaeus closed 1 week ago