openai / evals

Evals is a framework for evaluating LLMs and LLM systems, and an open-source registry of benchmarks.
Other
14.35k stars 2.54k forks source link

AnthropicSolver #1498

Closed thesofakillers closed 3 months ago

thesofakillers commented 3 months ago

This PR contributes an AnthropicSolver class, a solver for using models served by the Anthropic Claude API, such as claude 3.

Besides basic functionality, the solver provides the following features

Notes: