openai mle-bench issues

openai / mle-bench

MLE-bench is a benchmark for measuring how well AI agents perform at machine learning engineering

https://openai.com/index/mle-bench/

Other

529 stars 59 forks source link

issues

Newest

Newest Most commented Recently updated Oldest Least commented Least recently updated

Ensure data dir paths are absolute

#21 thesofakillers closed 22 hours ago
0
Error while preparing `the-icml-2013-whale-challenge-right-whale-redux`

#20 yorhaha closed 22 hours ago
2
Calculating scores

#19 SamuelSchmidgall closed 3 weeks ago
9
Adds the grading reports from our paper runs

#18 thesofakillers closed 3 weeks ago
0
Adds file outlining competition categories

#17 thesofakillers closed 1 month ago
0
Information request the result of each dataset in the benchmark

#16 boranhan closed 1 month ago
1
Update make_submission.py

#15 smit23patel closed 3 weeks ago
1
Cannot download 'detecting-insults-in-social-commentary' data due to the 'late submission' button on Kaggle grey

#14 jessyford closed 1 month ago
6
Information request about the competitions in each category in Figure 6

#13 JK-SHIN-PG closed 1 month ago
7
Improve Efficiency and Error Handling in `mlebench/cli.py`

#12 Mefisto04 closed 1 month ago
2
Fix spelling mistake

#11 danesherbs closed 1 month ago
0
Fixed grammatical errors and added emojis for better readability

#10 NishantRana07 closed 1 month ago
1
Breakdown of which benchmarks were solved in paper

#9 SamuelSchmidgall closed 3 weeks ago
10
Add citation to README

#8 james-aung closed 1 month ago
0
docs: update description.md

#7 eltociear closed 1 month ago
0
Fix typo error for the assertion

#6 Aiden-Jeon closed 1 month ago
0
Why 82 datasets are in `./mlebench/compositions` folder?

#5 JK-SHIN-PG closed 1 month ago
1
add MLE-Agent support and test it using MLE-bench

#4 HuaizhengZhang closed 1 month ago
2
Add arXiv URL

#3 danesherbs closed 1 month ago
0
Add authors

#2 ojaffe closed 1 month ago
0
Fix typo

#1 ojaffe closed 1 month ago
0