Closed chunniunai220ml closed 1 month ago
Here is the output. We use greedy decoding, and the score may vary a little due to the precision problem. @chunniunai220ml
Here is the output. We use greedy decoding, and the score may vary a little due to the precision problem. @chunniunai220ml
can u provide example code with LEVAL? like https://github.com/OpenLMLab/LEval/blob/main/Baselines/llama2-chat-test.py, i modified the file to adapt QWEN1.5-CHAT-7B, but failed to reproduce your results
@hzhwcmhf hi, any further infomation for example code with LEVAL
This issue has been automatically marked as inactive due to lack of recent activity. Should you believe it remains unresolved and warrants attention, kindly leave a comment on this thread.
how to reproduce QWEN1.5-7B-CHAT results as you report:
i got to TOEFL=30.198 by https://github.com/OpenLMLab/LEval as u mentioned