openai / openai-cookbook

Examples and guides for using the OpenAI API
https://cookbook.openai.com
MIT License
58.83k stars 9.34k forks source link

Advanced Evals: Combining Multiple Annotators of Varying Quality #1379

Open nelsonauner opened 1 month ago

nelsonauner commented 1 month ago

Summary

This summary adds a notebook for dealing with multiple annotators, partially replicating the seminal MT-Bench paper while using OpenAI's new structured outputs and token probability features unavailable in other LLMs

Motivation

There currently is no cookbook on dealing with combining a large number of human annotators with LLM-As-Judge, even though this practice is quite common.


For new content

When contributing new content, read through our contribution guidelines, and mark the following action items as completed:

We will rate each of these areas on a scale from 1 to 4, and will only accept contributions that score 3 or higher on all areas. Refer to our contribution guidelines for more details.

nelsonauner commented 3 weeks ago

@pap-openai Anything I can do to help get this reviewed (or close it out?)

pap-openai commented 2 weeks ago

@pap-openai Anything I can do to help get this reviewed (or close it out?)

Hi @nelsonauner, we're currently working on a contributor license agreement and will come back a bit later once we've figured that out