trishullab / PutnamBench

An evaluation benchmark for undergraduate competition math in Lean4, Isabelle, Coq, and natural language.
56 stars 8 forks source link

batches 1962 and 1968 (verify Isabelle and formalize easy in Coq) #153

Closed leejasper851 closed 3 months ago