issues
search
trishullab
/
PutnamBench
An evaluation benchmark for undergraduate competition math in Lean4, Isabelle, Coq, and natural language.
56
stars
8
forks
source link
batches 1962 and 1968 (verify Isabelle and formalize easy in Coq)
#153
Closed
leejasper851
closed
3 months ago