issues
search
EQ-bench
/
EQ-Bench
A benchmark for emotional intelligence in large language models
MIT License
180
stars
13
forks
source link
issues
Newest
Newest
Most commented
Recently updated
Oldest
Least commented
Least recently updated
Request to evaluate the new O1 models by OpenAI (O1-preview and O1-mini)
#36
Belzedar94
closed
1 week ago
1
[Request] Creative Writing Benchmark
#35
Abdulhanan535
closed
6 days ago
1
New Command R 08-2024 and Command R+ 08-2024 models
#34
jukofyork
closed
6 days ago
1
Paper on creative writing benchmark
#33
AriMKatz
closed
1 month ago
2
Trying to get to the bottom of why `Qwen1.5-110B-Chat` scores so much higher than the `command-r` models
#32
jukofyork
closed
1 month ago
1
EQ-Bench localized to Polish
#31
djstrong
opened
2 months ago
3
Fix GPU cleanup
#30
djstrong
closed
2 months ago
0
Offload to the cpu
#29
djstrong
closed
2 months ago
5
Benchmark Failed
#28
djstrong
closed
3 months ago
7
+utf8
#27
CrispStrobe
closed
3 months ago
1
correcting a few labels
#26
CrispStrobe
closed
3 months ago
1
Error in calculating revise answer score
#25
impact-rm
closed
3 months ago
2
Contributing to OpenCompass
#24
bittersweet1999
opened
4 months ago
3
Contributing with other judges
#23
Krisseck
opened
4 months ago
3
Passing in model_kwargs
#22
derpyplops
closed
5 months ago
1
default judge model setting for the leaderboard
#21
gyin94
closed
5 months ago
1
model test request
#20
dnhkng
closed
5 months ago
26
'BitsAndBytesConfig' object has no attribute 'get_loading_attributes'
#19
Abdullah-kwl
closed
6 months ago
2
Start llama.cpp server
#18
dnhkng
opened
6 months ago
2
Add llama server inference
#17
dnhkng
closed
6 months ago
1
Backend changes scores significantly
#16
dnhkng
closed
5 months ago
50
+install windows
#15
CrispStrobe
opened
6 months ago
6
Input length of input_ids is 1211, but `max_length` is set to 1000. This can lead to unexpected behavior. You should consider increasing `max_length` or, better yet, setting `max_new_tokens`. Benchmark run failed
#14
Abdullah-kwl
closed
6 months ago
2
-verbose
#13
CrispStrobe
closed
6 months ago
0
added support for additional language (de)
#12
CrispStrobe
closed
6 months ago
6
handling model tags for ollama
#11
CrispStrobe
closed
7 months ago
0
handling model tags for ollama
#10
CrispStrobe
closed
7 months ago
1
include gemma.cpp
#9
CrispStrobe
closed
7 months ago
1
added gemma.cpp
#8
CrispStrobe
closed
7 months ago
0
adding support for alternative openai api: config.cfg
#7
CrispStrobe
closed
7 months ago
0
add support for alternative openai api
#6
CrispStrobe
closed
7 months ago
1
Add some of the new 100B+ models to the leaderboard
#5
cosmojg
closed
7 months ago
4
v2 outputs
#4
gblazex
closed
7 months ago
12
Support for Seq2Seq LMs
#3
CarlsVoca
closed
7 months ago
2
Add Claude
#2
tekumara
closed
7 months ago
1
The prompt to generate the dialogue.
#1
GorgeousWang
closed
9 months ago
2