openai / mle-bench

MLE-bench is a benchmark for measuring how well AI agents perform at machine learning engineering
Other
422 stars 39 forks source link

Add arXiv URL #3

Closed danesherbs closed 1 week ago

danesherbs commented 1 week ago

Adds arXiv URL to the README.