Open pengyxhack opened 7 months ago
Hello, this is a very interesting work! May I ask if it's possible to provide the test set for evaluating the accuracy of the evaluator? Thank you very much.
Hi, @pengyxhack ! We directly evaluated our evaluator on the PopQA test set in our paper. You can refer to Self-RAG for more details.
Hello, this is a very interesting work! May I ask if it's possible to provide the test set for evaluating the accuracy of the evaluator? Thank you very much.