abacaj / code-eval

Run evaluation on LLMs using human-eval benchmark
MIT License
362 stars 34 forks source link

where can i get the result of perfrmance after evaluating the llama2-7b #15

Closed qxpBlog closed 6 months ago

qxpBlog commented 6 months ago

@abacaj after evaluating the llama2-7b. i only get a file eval.json: image so how can i get the result following: image