clp-research / clembench

A Framework for the Systematic Evaluation of Chat-Optimized Language Models as Conversational Agents and an Extensible Benchmark
MIT License
19 stars 26 forks source link

wordle game should use standard method for checking whether it runs over max context size #23

Open davidschlangen opened 7 months ago

davidschlangen commented 7 months ago

for which see the other issue on adding method for getting actual size of context from backend.

davidschlangen commented 7 months ago

See #22