vectara hallucination-leaderboard issues

vectara / hallucination-leaderboard

Leaderboard Comparing LLM Performance at Producing Hallucinations when Summarizing Short Documents

https://vectara.com

Apache License 2.0

1.25k stars 50 forks source link

issues

Newest

Newest Most commented Recently updated Oldest Least commented Least recently updated

Removed cohere command r 8bitq

#31 mbae26 closed 8 months ago
0
Added cohere's command r model

#30 mbae26 closed 8 months ago
0
Update Claude3 opus and sonnet

#29 mbae26 closed 9 months ago
0
Fix link to csv in README.

#28 gueraf closed 9 months ago
0
Added gemma-7b-it

#27 mbae26 closed 9 months ago
0
Would you please provide a citation bibtex?

#26 JacksonWuxs closed 10 months ago
2
Added Falcon-7b-instruct

#25 mbae26 closed 10 months ago
0
Any arxiv paper or report?

#24 zhimin-z closed 10 months ago
2
Update README.md

#23 mbae26 closed 10 months ago
0
Fix typos in README file

#22 pitmonticone closed 10 months ago
0
Can we reproduce the leadersboard?

#21 amir-abdi closed 11 months ago
1
Added Titan-express model

#20 mbae26 closed 11 months ago
0
Can you add a CITATION.cff for easy citation?

#19 sigjhl closed 11 months ago
1
Update leaderboard

#18 mbae26 closed 11 months ago
1
Update README.md

#17 mbae26 closed 11 months ago
0
Claude 2.1 Benchmark Missing

#16 lukestanley closed 11 months ago
1
Update API details for Cohere, specifying which model we used

#15 mbae26 closed 11 months ago
0
Change in update date

#14 mbae26 closed 1 year ago
0
Updated Hallucination Evaluation Leaderboard and made a short comment…

#13 mbae26 closed 1 year ago
0
hide borders

#12 ofermend closed 1 year ago
0
minor update

#11 ofermend closed 1 year ago
0
added in memory of Simon

#10 ofermend closed 1 year ago
0
Generation parameters

#9 qmdnls closed 1 year ago
1
Google PaLM reference

#8 suddhasatwabhaumik closed 11 months ago
2
Update README.md

#7 eltociear closed 1 year ago
1
instance level metric outputs

#6 cabreraalex opened 1 year ago
4
Google Palm API version?

#5 zizhaozhang closed 1 year ago
1
how did you determine what is factually correct ?

#4 listaction closed 1 year ago
2
GPT 4-Turbo

#3 orionsolidified closed 1 year ago
1
Any date on releasing the training script for the model?

#2 deshwalmahesh closed 1 year ago
2
Integrate with LiteLLM - Evaluate 100+LLMs, 92% faster

#1 ishaan-jaff closed 11 months ago
1