allenai / open-instruct

Apache License 2.0
1.22k stars 166 forks source link

Update TruthfulQA evaluation using retrained LLaMa judge models. #112

Closed yizhongw closed 7 months ago

yizhongw commented 7 months ago

Please find the details about how the llama-based truthfulqa judge models are trained here.