potsawee / selfcheckgpt

SelfCheckGPT: Zero-Resource Black-Box Hallucination Detection for Generative Large Language Models
MIT License
442 stars 54 forks source link

What's the meaning of Non-Factual* in the paper? #2

Closed skpig closed 1 year ago

skpig commented 1 year ago

Same as the title. Thanks in advance.

potsawee commented 1 year ago

Hi @skpig

Sorry for the confusion about Non-Factual, the denotes "less-trivial hallucination detection" as in our setup/definition, the percentage of hallucination is quite high (in some examples, all sentences are labelled as major-hallucination), we would like to see the performance of these methods in detecting more challenging hallucination detection. You can see in this notebook about how we created and used "Non-Factual*" labels (human_label_detect_False_h): https://github.com/potsawee/selfcheckgpt/blob/main/demo/experiments/probability-based-baselines.ipynb

Hope this clarifies your question Potsawee

skpig commented 1 year ago

Thanks for your response!