issues
search
vectara
/
hallucination-leaderboard
Leaderboard Comparing LLM Performance at Producing Hallucinations when Summarizing Short Documents
https://vectara.com
Apache License 2.0
1.25k
stars
50
forks
source link
issues
Newest
Newest
Most commented
Recently updated
Oldest
Least commented
Least recently updated
Removed cohere command r 8bitq
#31
mbae26
closed
8 months ago
0
Added cohere's command r model
#30
mbae26
closed
8 months ago
0
Update Claude3 opus and sonnet
#29
mbae26
closed
9 months ago
0
Fix link to csv in README.
#28
gueraf
closed
9 months ago
0
Added gemma-7b-it
#27
mbae26
closed
9 months ago
0
Would you please provide a citation bibtex?
#26
JacksonWuxs
closed
10 months ago
2
Added Falcon-7b-instruct
#25
mbae26
closed
10 months ago
0
Any arxiv paper or report?
#24
zhimin-z
closed
10 months ago
2
Update README.md
#23
mbae26
closed
10 months ago
0
Fix typos in README file
#22
pitmonticone
closed
10 months ago
0
Can we reproduce the leadersboard?
#21
amir-abdi
closed
11 months ago
1
Added Titan-express model
#20
mbae26
closed
11 months ago
0
Can you add a CITATION.cff for easy citation?
#19
sigjhl
closed
11 months ago
1
Update leaderboard
#18
mbae26
closed
11 months ago
1
Update README.md
#17
mbae26
closed
11 months ago
0
Claude 2.1 Benchmark Missing
#16
lukestanley
closed
11 months ago
1
Update API details for Cohere, specifying which model we used
#15
mbae26
closed
11 months ago
0
Change in update date
#14
mbae26
closed
1 year ago
0
Updated Hallucination Evaluation Leaderboard and made a short comment…
#13
mbae26
closed
1 year ago
0
hide borders
#12
ofermend
closed
1 year ago
0
minor update
#11
ofermend
closed
1 year ago
0
added in memory of Simon
#10
ofermend
closed
1 year ago
0
Generation parameters
#9
qmdnls
closed
1 year ago
1
Google PaLM reference
#8
suddhasatwabhaumik
closed
11 months ago
2
Update README.md
#7
eltociear
closed
1 year ago
1
instance level metric outputs
#6
cabreraalex
opened
1 year ago
4
Google Palm API version?
#5
zizhaozhang
closed
1 year ago
1
how did you determine what is factually correct ?
#4
listaction
closed
1 year ago
2
GPT 4-Turbo
#3
orionsolidified
closed
1 year ago
1
Any date on releasing the training script for the model?
#2
deshwalmahesh
closed
1 year ago
2
Integrate with LiteLLM - Evaluate 100+LLMs, 92% faster
#1
ishaan-jaff
closed
11 months ago
1
Previous