shmsw25 / FActScore

A package to evaluate factuality of long-form generation. Original implementation of our EMNLP 2023 paper "FActScore: Fine-grained Atomic Evaluation of Factual Precision in Long Form Text Generation"
https://arxiv.org/abs/2305.14251
MIT License
292 stars 43 forks source link

Instruct LLAMA integration #9

Closed martiansideofthemoon closed 1 year ago

martiansideofthemoon commented 1 year ago

5

LLAMA weight differences ---
sum: tensor(0.0013)
average: tensor(1.3684e-13)

as a baseline, differences between LLAMA and Inst-LLAMA:
sum: tensor(7083599.5000)
avg: tensor(0.0005)