openai / evals

Evals is a framework for evaluating LLMs and LLM systems, and an open-source registry of benchmarks.
Other
14.76k stars 2.58k forks source link

Remove citation prediction eval #1512

Closed ojaffe closed 6 months ago

ojaffe commented 6 months ago

@JunShern will review this

Removed broken Citation Prediction eval.