Open adiprasad opened 1 month ago
Trying to reproduce evaluation numbers but not able to.
Ex : For gemma-2-9b, the technical report mentions 68.2 on BBH 3 shot CoT while the open llm leaderboard reported 5.05
Was there any special setting used for evaluations ?
@tilakrayal any update on this ?
Trying to reproduce evaluation numbers but not able to.
Ex : For gemma-2-9b, the technical report mentions 68.2 on BBH 3 shot CoT while the open llm leaderboard reported 5.05
Was there any special setting used for evaluations ?