scicode-bench / SciCode

A benchmark that challenges language models to code solutions for scientific problems
Apache License 2.0
83 stars 8 forks source link

o1 models results #17

Closed stalkermustang closed 2 weeks ago

stalkermustang commented 3 weeks ago

🙏🙏 please, we need to know :)

mtian8 commented 2 weeks ago

We have added o1 results here: https://github.com/scicode-bench/SciCode?tab=readme-ov-file#-leaderboard