Closed loubnabnl closed 1 year ago
This PR adds options to use the evaluation harness to do text generation only or evaluation only (on previously computed generations). It also adds a CI and tests for HumanEval and MBPP.
cc @ocramz
This PR adds options to use the evaluation harness to do text generation only or evaluation only (on previously computed generations). It also adds a CI and tests for HumanEval and MBPP.
cc @ocramz