issues
search
potsawee
/
selfcheckgpt
SelfCheckGPT: Zero-Resource Black-Box Hallucination Detection for Generative Large Language Models
MIT License
442
stars
54
forks
source link
issues
Newest
Newest
Most commented
Recently updated
Oldest
Least commented
Least recently updated
Tiny bug: missing import and variable in README.md
#33
annahedstroem
opened
1 day ago
0
Could you provide a example of showing the AUC-PR of your own methods(i.e. the selfcheckgpt) just like the probability-based-baselines?
#32
goatyu3
opened
2 months ago
2
Add support for Groq's API
#31
WladimirLct
closed
3 months ago
0
Added demo notebook to detect hallucination score for a given query and LLM
#30
Kirushikesh
closed
3 months ago
0
What are these 2 numbers means in the example result
#29
orriduck
opened
5 months ago
3
Question about random baseline
#28
WWWonderer
closed
5 months ago
2
Add SelfCheckGPT-prompt with OpenAI API
#27
potsawee
closed
6 months ago
0
Possible annotation errors?
#26
pramitchoudhary
opened
7 months ago
1
Feedback: Adding Notebook for R and S generation by LLM
#25
Kirushikesh
closed
3 months ago
3
Questions about the SelfCheckGPT-NLI and SelfCheckGPT-Prompt
#24
hbr690188270
closed
4 months ago
3
merge LLM prompt implementation
#23
potsawee
closed
7 months ago
0
can't find wiki_bio_test_idx indices in the original wikibio test set
#22
tuvllms
closed
8 months ago
3
about passage-level human annotations
#21
141forever
closed
6 months ago
3
Which version of LLaMA model is used?
#20
Moximixi
closed
9 months ago
1
When I run the latest version of 'probability-baselines.ipynb', there is an error. Could you please provide the full version number of the experimental environment?
#19
Moximixi
closed
10 months ago
3
Can you provide the datasets in the probability-baselines.ipynb file?
#18
Moximixi
closed
10 months ago
2
Contradiction Threshold for NLI Approach
#17
ktangri
closed
1 year ago
2
Could you provide an example of using ngram to predict the factuality of a sentence?
#16
xu1868
opened
1 year ago
1
How to combine the three variants of selfcheckgpt in your paper?
#15
dhx20150812
opened
1 year ago
6
Open Question: Fact Checking LLM
#14
GvdDool
closed
1 year ago
4
Is this method actually useful in the real world?
#13
YooSungHyun
closed
1 year ago
3
How long would it usually take to run all three score?
#12
yihan-zhou
closed
1 year ago
2
What's the meaning of β in Appendix B?
#11
coderlemon17
closed
1 year ago
4
range of selfcheck_bertscore
#10
EngSalem
closed
1 year ago
2
Can you provide the name list that is used for creating the dataset.
#9
soap117
closed
1 year ago
1
Update README.md
#8
adianliusie
closed
1 year ago
0
ngram experiment
#7
potsawee
closed
1 year ago
0
code for selfcheck n-gram
#6
potsawee
closed
1 year ago
0
code for proxy model
#5
zthang
closed
1 year ago
2
Update README.md
#4
potsawee
closed
1 year ago
0
Can the proposed method be used for other domain?
#3
zthang
closed
1 year ago
4
What's the meaning of Non-Factual* in the paper?
#2
skpig
closed
1 year ago
2
Code to reproduce the paper's evaluations
#1
rfriel
closed
1 year ago
1