clp-research / clembench

A Framework for the Systematic Evaluation of Chat-Optimized Language Models as Conversational Agents and an Extensible Benchmark
MIT License
19 stars 26 forks source link

Model additions #97

Open Gnurro opened 1 week ago

Gnurro commented 1 week ago

Adds Qwen2-7B-Instruct, Qwen2-72B-Instruct, Aya-23-8B and Aya-23-35B to the model registry. Increments required transformers version to 4.41.2 to support Phi-3 models.

Gnurro commented 4 days ago

Also adds Gemma-2 models to the registry, and updates requirements needed to run them.