abacaj / code-eval

Run evaluation on LLMs using human-eval benchmark
MIT License
362 stars 34 forks source link

add extra models #8

Closed abacaj closed 1 year ago