clp-research / clembench

A Framework for the Systematic Evaluation of Chat-Optimized Language Models as Conversational Agents and an Extensible Benchmark
MIT License
19 stars 26 forks source link

multilingual referencegame #69

Closed AnneBeyer closed 3 months ago

AnneBeyer commented 3 months ago

Adapt instance generation and add multilingual instances.