You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Thanks! We already looked into it, but currently API access to O1 models is restricted to Tier 5 OpenAI developers. We will look into this as soon as it opens up.
From early manual testing, it seems like O1 is superior to 4O, but we were surprised to see that it doesn't reach maximum score on the few instances we tried.
Request in title. Also, it would be amazing to see the leaderboard somewhere in the github readme. Love your work! :)
The text was updated successfully, but these errors were encountered: