potsawee selfcheckgpt issues

potsawee / selfcheckgpt

SelfCheckGPT: Zero-Resource Black-Box Hallucination Detection for Generative Large Language Models

MIT License

442 stars 54 forks source link

issues

Newest

Newest Most commented Recently updated Oldest Least commented Least recently updated

Tiny bug: missing import and variable in README.md

#33 annahedstroem opened 1 day ago
0
Could you provide a example of showing the AUC-PR of your own methods(i.e. the selfcheckgpt) just like the probability-based-baselines?

#32 goatyu3 opened 2 months ago
2
Add support for Groq's API

#31 WladimirLct closed 3 months ago
0
Added demo notebook to detect hallucination score for a given query and LLM

#30 Kirushikesh closed 3 months ago
0
What are these 2 numbers means in the example result

#29 orriduck opened 5 months ago
3
Question about random baseline

#28 WWWonderer closed 5 months ago
2
Add SelfCheckGPT-prompt with OpenAI API

#27 potsawee closed 6 months ago
0
Possible annotation errors?

#26 pramitchoudhary opened 7 months ago
1
Feedback: Adding Notebook for R and S generation by LLM

#25 Kirushikesh closed 3 months ago
3
Questions about the SelfCheckGPT-NLI and SelfCheckGPT-Prompt

#24 hbr690188270 closed 4 months ago
3
merge LLM prompt implementation

#23 potsawee closed 7 months ago
0
can't find wiki_bio_test_idx indices in the original wikibio test set

#22 tuvllms closed 8 months ago
3
about passage-level human annotations

#21 141forever closed 6 months ago
3
Which version of LLaMA model is used?

#20 Moximixi closed 9 months ago
1
When I run the latest version of 'probability-baselines.ipynb', there is an error. Could you please provide the full version number of the experimental environment?

#19 Moximixi closed 10 months ago
3
Can you provide the datasets in the probability-baselines.ipynb file?

#18 Moximixi closed 10 months ago
2
Contradiction Threshold for NLI Approach

#17 ktangri closed 1 year ago
2
Could you provide an example of using ngram to predict the factuality of a sentence?

#16 xu1868 opened 1 year ago
1
How to combine the three variants of selfcheckgpt in your paper?

#15 dhx20150812 opened 1 year ago
6
Open Question: Fact Checking LLM

#14 GvdDool closed 1 year ago
4
Is this method actually useful in the real world?

#13 YooSungHyun closed 1 year ago
3
How long would it usually take to run all three score?

#12 yihan-zhou closed 1 year ago
2
What's the meaning of β in Appendix B?

#11 coderlemon17 closed 1 year ago
4
range of selfcheck_bertscore

#10 EngSalem closed 1 year ago
2
Can you provide the name list that is used for creating the dataset.

#9 soap117 closed 1 year ago
1
Update README.md

#8 adianliusie closed 1 year ago
0
ngram experiment

#7 potsawee closed 1 year ago
0
code for selfcheck n-gram

#6 potsawee closed 1 year ago
0
code for proxy model

#5 zthang closed 1 year ago
2
Update README.md

#4 potsawee closed 1 year ago
0
Can the proposed method be used for other domain?

#3 zthang closed 1 year ago
4
What's the meaning of Non-Factual* in the paper?

#2 skpig closed 1 year ago
2
Code to reproduce the paper's evaluations

#1 rfriel closed 1 year ago
1