CarperAI / cheese

Used for adaptive human in the loop evaluation of language and embedding models.
MIT License
303 stars 24 forks source link

General Model Upgrades #31

Closed shahbuland closed 2 years ago

shahbuland commented 2 years ago

Added many upgrades for using models with CHEESE, as well as several other QOL changes. Of note is that CHEESE now supports offline evaluation (i.e. cases with no human in the loop). Some examples of where this would be useful could be to evaluate one model with another, or to score model outputs in some automated way. There's some kinks in how this works as a consequence of how rabbitmq works, but these are outlined in examples and documentation.

API:

Pipelines:

Model: