Closed haoxiongliu closed 8 months ago
Hi,Haoxiong, Many thanks for your attention. The reason the data we reported and the data you tested differ is that when uploaded to the Hugging Face repository, Hugging Face performs some processing on the model weights. The data we tested locally is consistent with the data we reported.
Hi,Haoxiong, Many thanks for your attention. The reason the data we reported and the data you tested differ is that when uploaded to the Hugging Face repository, Hugging Face performs some processing on the model weights. The data we tested locally is consistent with the data we reported.
Thank you for your reply. I wonder do you know what kind of processing that Hugging Face performs on the model weights? If I know what it is, may be I could try to fix this gap by myself.
I tried run_mistral.sh and get: gsm8k acc==== 0.7376800606520091 MATH acc==== 0.2726
I also tried
and get: gsm8k acc==== 0.7710386656557998 MATH acc==== 0.278
which is also a bit different from the reported 77.7 and 28.2.
I would like to know your opinion on if this is normal and what might be the cause. Thanks!