Open 2proveit opened 1 year ago
Thanks for your great work in creating this dataset, I have questions while evaluating llama2-7b-chat on this dataset.
llama2-7b-chat
def acc(eval_preds:EvalPrediction): logits, labels = eval_preds preds = tokenizer.batch_decode(logits, skip_special_tokens=True) labels = tokenizer.batch_decode(labels, skip_special_tokens=True) save_results(preds, labels) # save results to json file preds = [last_boxed_only_string(s) for s in preds] correct = 0 total = 0 for pred, label in zip(preds, labels): if is_equiv(pred, label): correct += 1 total += 1 return {"accuracy": correct / total} return acc
Thanks for your great work in creating this dataset, I have questions while evaluating
llama2-7b-chat
on this dataset.llama2-7b-chat
output remains 0 when the training goes, here is my code: