clp-research / clembench

A Framework for the Systematic Evaluation of Chat-Optimized Language Models as Conversational Agents and an Extensible Benchmark
MIT License
19 stars 26 forks source link

[framework] rethink response parsing #86

Open davidschlangen opened 2 months ago

davidschlangen commented 2 months ago

do we want to centralise this?

(placeholder, add description)

related to #27