issues
search
openai
/
mle-bench
MLE-bench is a benchmark for measuring how well AI agents perform at machine learning engineering
https://openai.com/index/mle-bench/
Other
529
stars
59
forks
source link
issues
Newest
Newest
Most commented
Recently updated
Oldest
Least commented
Least recently updated
Ensure data dir paths are absolute
#21
thesofakillers
closed
22 hours ago
0
Error while preparing `the-icml-2013-whale-challenge-right-whale-redux`
#20
yorhaha
closed
22 hours ago
2
Calculating scores
#19
SamuelSchmidgall
closed
3 weeks ago
9
Adds the grading reports from our paper runs
#18
thesofakillers
closed
3 weeks ago
0
Adds file outlining competition categories
#17
thesofakillers
closed
1 month ago
0
Information request the result of each dataset in the benchmark
#16
boranhan
closed
1 month ago
1
Update make_submission.py
#15
smit23patel
closed
3 weeks ago
1
Cannot download 'detecting-insults-in-social-commentary' data due to the 'late submission' button on Kaggle grey
#14
jessyford
closed
1 month ago
6
Information request about the competitions in each category in Figure 6
#13
JK-SHIN-PG
closed
1 month ago
7
Improve Efficiency and Error Handling in `mlebench/cli.py`
#12
Mefisto04
closed
1 month ago
2
Fix spelling mistake
#11
danesherbs
closed
1 month ago
0
Fixed grammatical errors and added emojis for better readability
#10
NishantRana07
closed
1 month ago
1
Breakdown of which benchmarks were solved in paper
#9
SamuelSchmidgall
closed
3 weeks ago
10
Add citation to README
#8
james-aung
closed
1 month ago
0
docs: update description.md
#7
eltociear
closed
1 month ago
0
Fix typo error for the assertion
#6
Aiden-Jeon
closed
1 month ago
0
Why 82 datasets are in `./mlebench/compositions` folder?
#5
JK-SHIN-PG
closed
1 month ago
1
add MLE-Agent support and test it using MLE-bench
#4
HuaizhengZhang
closed
1 month ago
2
Add arXiv URL
#3
danesherbs
closed
1 month ago
0
Add authors
#2
ojaffe
closed
1 month ago
0
Fix typo
#1
ojaffe
closed
1 month ago
0