Closed yiqingxyq closed 4 months ago
Hi Yiqing,
The claims and documents from the benchmark are collected from the previous work:
You can find more detailed description of each dataset in Appendix C of our work.
We are planning to include one additional dataset into the benchmark in a week or so. Stay tuned!
Hi!
How did you generate the claims in the benchmark? Did you (1) directly prompt the model to generate a claim, (2) first generate model responses and then decompose them into claims, and (3) use other ways?
Thanks!