Using unlabeled data to generate atomic facts and retrieving evidence

shmsw25 / FActScore

A package to evaluate factuality of long-form generation. Original implementation of our EMNLP 2023 paper "FActScore: Fine-grained Atomic Evaluation of Factual Precision in Long Form Text Generation"

MIT License

275 stars 40 forks source link

Thank you for your question.

The demos data consists of demonstrations (in-context learning) for the Atomic Fact Generator. So it's part of the data the model uses, rather than being used as the test data. Annotations in the demos data are written by humans.
We are validating each atomic fact based on the Wikipedia article about the subject. Thus, retrieval is restricted to this single article. Please refer to the Implementation details in 4.1.2 of the paper. It is possible to relax this restriction but we empirically found this to be working better.

shmsw25 / FActScore

Using unlabeled data to generate atomic facts and retrieving evidence #22