hiyouga / LLaMA-Factory

Unify Efficient Fine-Tuning of 100+ LLMs
Apache License 2.0
25.26k stars 3.13k forks source link

Low MMLU of llama2 #4436

Closed Zkli-hub closed 4 days ago

Zkli-hub commented 5 days ago

Reminder

System Info

Reproduction

CUDA_VISIBLE_DEVICES=0 llamafactory-cli eval --model_name_or_path PATH --template fewshot --task mmlu --split test --lang en --n_shot 5 --batch_size 1

Expected behavior

Evaluate the MMLU performance of llama2-7b-base, but the MMLU results is lower than the original paper which is 45.3 image

Others

No response

hiyouga commented 4 days ago

A small derivation is acceptable and usual