bigcode-project / Megatron-LM

Ongoing research training transformer models at scale
Other
371 stars 48 forks source link

Incomplete humaneval evaluation code #56

Closed huybery closed 1 year ago

huybery commented 1 year ago

hi @RaymondLi0 , in the branch evaluation, I found the evaluation code related to humaneval, but unfortunately it doesn't work. Some specific mismatches are as follows.

https://github.com/bigcode-project/Megatron-LM/blob/bd14566aa064afb60b6a0e6bf323c6214a2ca093/tasks/human_eval/generate_samples.py#L161

prefix_lm and sep_in_bidirectional_context are not arguments of generate_and_post_process.

Could you help me update it ?