clp-research / clembench

A Framework for the Systematic Evaluation of Chat-Optimized Language Models as Conversational Agents and an Extensible Benchmark
MIT License
26 stars 34 forks source link

Merging clembench v1.0 #53

Closed lpfennigschmidt closed 9 months ago