issues
search
vectara
/
hallucination-leaderboard
Leaderboard Comparing LLM Performance at Producing Hallucinations when Summarizing Short Documents
https://vectara.com
Apache License 2.0
1.25k
stars
50
forks
source link
issues
Newest
Newest
Most commented
Recently updated
Oldest
Least commented
Least recently updated
add results for olmo2 models
#81
Miaoranmmm
closed
4 hours ago
0
Reproducing HF model Summaries
#80
Noor-Nizar
opened
5 days ago
0
add results for claude-3.5-haiku
#79
Miaoranmmm
closed
3 weeks ago
0
add results for cohere aya-expanse 8b and 32b
#78
Miaoranmmm
closed
1 month ago
0
Claude 3.5 Sonnet New
#77
sushantnair
opened
1 month ago
2
add results for xai grok-beta
#76
Miaoranmmm
closed
1 month ago
0
update results for granite-3.0 2b and 8b
#75
Miaoranmmm
closed
1 month ago
0
update result for claude-3-5-sonnet-20241022
#74
Miaoranmmm
closed
1 month ago
0
Update README.md
#73
ofermend
closed
1 month ago
0
add results for gemini 002 models and remove experimental models
#72
Miaoranmmm
closed
2 months ago
0
add results for llama 3.2 models
#71
Miaoranmmm
closed
2 months ago
0
add results for Qwen2.5-Instruct 0.5B, 1.5B, 3B, 7B, 14B
#70
Miaoranmmm
closed
2 months ago
0
update results for gemini-1.5-pro and gemini-1.5-flash, add results for gemini-flash-experimental and gemini-pro-experimental
#69
Miaoranmmm
closed
2 months ago
0
add results for openai o1 models
#68
Miaoranmmm
closed
2 months ago
0
add results for deepseek-v2.5
#67
Miaoranmmm
closed
2 months ago
0
add results for Qwen2-VL-2B/7B-Instruct; minor fix for plot
#66
Miaoranmmm
closed
2 months ago
0
add results for the newest command-r and command-r-plus; remove results for older cohere models
#65
Miaoranmmm
closed
2 months ago
0
add minicheck as a fact-checking model
#64
Liyan06
closed
2 months ago
0
add results for AI21-Jamba-1.5-Mini, Phi-3.5-MoE-instruct, Mistral-Nemo-Instruct
#63
Miaoranmmm
closed
3 months ago
0
add results for phi-3.5-mini-instruct
#62
Miaoranmmm
closed
3 months ago
0
update results with HF HHEM2.1
#61
Miaoranmmm
closed
3 months ago
0
add script for plot
#60
Miaoranmmm
closed
3 months ago
2
add results for gemma-2-2b-it and Mistral-7B-Instruct-v0.3, remove description of Mistral-7B-Instruct-v0.1 and v0.2
#59
Miaoranmmm
closed
3 months ago
0
add claude-3-5-sonnet and add % for answer rate
#58
Miaoranmmm
closed
3 months ago
0
Update README
#57
ofermend
closed
3 months ago
0
correct HHEM name; add hallucination rate plot
#56
forrestbao
closed
4 months ago
0
Leaderboard with hhem2.1
#55
Miaoranmmm
closed
4 months ago
0
update results for Yi-1.5-34/9/6B-Chat, Qwen2-72B-Instruct, and GLM-4-9B-Chat
#54
Miaoranmmm
closed
4 months ago
0
Update README.md
#53
eltociear
closed
4 months ago
0
add results for llama-3.1-8/70/405B
#52
Miaoranmmm
closed
4 months ago
0
add results for gpt-4o-mini
#51
Miaoranmmm
closed
4 months ago
0
How can I use the HHEM model to evaluate my LLM after finetuning?
#50
zjq0455
opened
4 months ago
1
add results for google/gemma-2-9b-it
#49
Miaoranmmm
closed
5 months ago
0
add results for Claude-3.5-Sonnet
#48
Miaoranmmm
closed
5 months ago
0
Add results for Gemini 1.5 Flash
#47
Miaoranmmm
closed
6 months ago
0
Add results for GPT-4o
#46
Miaoranmmm
closed
6 months ago
0
Add results for apple/OpenELM-3B-Instruct
#45
Miaoranmmm
closed
7 months ago
0
minor updates on general information
#44
Miaoranmmm
closed
7 months ago
0
add results for snowflake-arctic-instruct
#43
Miaoranmmm
closed
7 months ago
0
added Miaoran to authors list
#42
ofermend
closed
7 months ago
0
Add results for Phi-3-mini-4k-instruct and Phi-3-mini-128k-instruct
#41
Miaoranmmm
closed
7 months ago
0
add results for gemini-1.5-pro
#40
Miaoranmmm
closed
7 months ago
0
add results for mistralai/Mixtral-8x22B-Instruct-v0.1
#39
Miaoranmmm
closed
7 months ago
0
add results for microsoft/WizardLM-2-8x22B and update google drive link
#38
Miaoranmmm
closed
7 months ago
0
update result for GPT-4-Turbo-2024-04-09
#37
Miaoranmmm
closed
7 months ago
0
add result for databricks/dbrx-instruct
#36
Miaoranmmm
closed
7 months ago
0
add cohere command-r-plus
#35
Miaoranmmm
closed
7 months ago
0
add results for gemma-1.1-7b-it and gemma-1.1-2b-it
#34
Miaoranmmm
closed
7 months ago
0
added mistral-7b-instruct-v0.2
#33
mbae26
closed
8 months ago
0
Added Intel Neural Chat 7B
#32
mbae26
closed
8 months ago
0
Next