Open Belzedar94 opened 1 month ago
Thanks! We already looked into it, but currently API access to O1 models is restricted to Tier 5 OpenAI developers. We will look into this as soon as it opens up. From early manual testing, it seems like O1 is superior to 4O, but we were surprised to see that it doesn't reach maximum score on the few instances we tried.
Hey! Do you have an update on this? Also, I'd love to see a leaderboard somewhere. Thanks a lot and keep up the good work!!
Request in title. Also, it would be amazing to see the leaderboard somewhere in the github readme. Love your work! :)