clp-research / clembench

A Framework for the Systematic Evaluation of Chat-Optimized Language Models as Conversational Agents and an Extensible Benchmark
MIT License
19 stars 26 forks source link

Feat/refactor model pairing #35

Closed phisad closed 5 months ago

phisad commented 5 months ago

As discussed in https://github.com/clp-research/clembench/issues/34