openai / mle-bench

MLE-bench is a benchmark for measuring how well AI agents perform at machine learning engineering
Other
422 stars 39 forks source link

add MLE-Agent support and test it using MLE-bench #4

Closed HuaizhengZhang closed 1 week ago

HuaizhengZhang commented 1 week ago

Thanks for the great work!!!

We develop an MLE-agent, which includes a mle kaggle command.

https://github.com/MLSysOps/MLE-agent?tab=readme-ov-file

We plan to test the MLE-agent with your MLE-bench and provide the evaluation results.

cc @huangyz0918

image

ojaffe commented 1 week ago

Exciting! Good luck MLE-agent

danesherbs commented 1 week ago

Closing this since it's a TODO on the MLSysOps/MLE-agent repo :)