trishullab / PutnamBench

An evaluation benchmark for undergraduate competition math in Lean4, Isabelle, Coq, and natural language.
45 stars 5 forks source link

Rebase branch #172

Open michelled01 opened 1 month ago