Hi, I'm trying to evaluate the performance of llama2-7b on math datas. However I found out that the form of prediction differs at every prediction. So it is not easy to extract only the answer from the output. Is there any way that I can do for this problem?
Hi, I'm trying to evaluate the performance of llama2-7b on math datas. However I found out that the form of prediction differs at every prediction. So it is not easy to extract only the answer from the output. Is there any way that I can do for this problem?