bigcode-project / bigcode-evaluation-harness

A framework for the evaluation of autoregressive code generation language models.
Apache License 2.0
702 stars 180 forks source link

Why the result of using multiple evaluation is 0 #188

Closed shuaiwang2022 closed 1 week ago

shuaiwang2022 commented 5 months ago

image

my settings image

loubnabnl commented 5 months ago

Are you running the execution inside the docker image we provide for MultiPL-E? as it requires extra dependencies