add MLE-Agent support and test it using MLE-bench

openai / mle-bench

MLE-bench is a benchmark for measuring how well AI agents perform at machine learning engineering

Other

422 stars 39 forks source link

Closed HuaizhengZhang closed 1 week ago

HuaizhengZhang commented 1 week ago

Thanks for the great work!!!

We develop an MLE-agent, which includes a mle kaggle command.

We plan to test the MLE-agent with your MLE-bench and provide the evaluation results.

cc @huangyz0918

ojaffe commented 1 week ago

Exciting! Good luck MLE-agent

danesherbs commented 1 week ago

Closing this since it's a TODO on the MLSysOps/MLE-agent repo :)