openai / miniF2F

Formal to Formal Mathematics Benchmark
321 stars 45 forks source link

Exact accuracies on miniF2F be presented more clearly? #123

Open brando90 opened 2 years ago

brando90 commented 2 years ago
DyeKuu commented 2 years ago

Hi Brando! By exact accuracy you means the accuracy breaking down to each statement, or the sota accuracy like the paper you mentioned here?

brando90 commented 2 years ago

yes, like the examples ones I provided. Let me know if you have other thoughts. Thanks!


Brando Miranda Ph.D. Student Computer Science, Stanford University EDGE Scholar, Stanford University @.**@.> website: https://brando90.github.io/brandomiranda/home.html

On Nov 1, 2022, at 6:01 PM, Kunhao ZHENG @.**@.>> wrote:

Hi Brando! By exact accuracy you means the accuracy breaking down to each statement, or the sota accuracy?

— Reply to this email directly, view it on GitHubhttps://github.com/openai/miniF2F/issues/123#issuecomment-1299410674, or unsubscribehttps://github.com/notifications/unsubscribe-auth/AAOE6LXIQKDB2QX2WURWRWLWGG4P5ANCNFSM6AAAAAARTXNYWE. You are receiving this because you authored the thread.Message ID: @.***>

DyeKuu commented 2 years ago

I can provide several paper that I know reporting accuracies on miniF2F, more or less in chronological order. The list may be incomplete and any fix welcome!

As the accuracies (pass-rate) are usually subject to the computation budget and the language. I only put the number on test split here, for the number of validation split it worth taking a look at the details in these paper.