openai / mle-bench

MLE-bench is a benchmark for measuring how well AI agents perform at machine learning engineering
https://openai.com/index/mle-bench/
Other
529 stars 59 forks source link

Adds file outlining competition categories #17

Closed thesofakillers closed 1 month ago

thesofakillers commented 1 month ago

Necessary for Figure 6 of the paper.

Fixes #13