issues
search
MLGroupJLU
/
LLM-eval-survey
The official GitHub page for the survey paper "A Survey on Evaluation of Large Language Models".
https://arxiv.org/abs/2307.03109
1.44k
stars
91
forks
source link
issues
Newest
Newest
Most commented
Recently updated
Oldest
Least commented
Least recently updated
Add SciSafeEval: A Comprehensive Benchmark for Safety Alignment of Large Language Models in Scientific Tasks
#32
DavidLee528
opened
1 month ago
0
Add LLMBox
#31
huyiwen
opened
2 months ago
0
The leaderboard website is down...
#30
zhimin-z
closed
1 month ago
1
Paper Title Change
#29
nlee-208
closed
4 months ago
0
Is all the evaluation of LLM done by changeing the prompts?
#28
PlantPotatoOnMoon
opened
11 months ago
0
Add FinanceBench
#27
kennethleungty
closed
5 months ago
0
Can you add SpyGame to your survey?
#26
Skytliang
closed
5 months ago
1
Can you add our recent work to your survey?
#25
grayground
opened
1 year ago
1
Add references
#24
ChanLiang
opened
1 year ago
0
Can you add LRV-Instruction to Your update Arxiv Version?
#23
FuxiaoLiu
opened
1 year ago
1
Add CMB to your paper
#22
g-h-chen
opened
1 year ago
4
咨询下,LLM的数据污染检测(判断数据集是否训练见过)技术方向靠谱吗?有推荐论文吗?
#21
gongjunjin
opened
1 year ago
1
请教下,通过评测反馈LLM模型优化有哪些方向可以研究吗?即评测能反馈LLM优化建议
#20
gongjunjin
opened
1 year ago
1
Add link to the MMBench project
#19
tiansiyuan
closed
1 year ago
0
Add paper, ALIGNING AI WITH SHARED HUMAN VALUES
#18
tiansiyuan
closed
1 year ago
1
but it still lack the ability to perform Eng → X translation
#17
tiansiyuan
closed
1 year ago
1
An idea AGI evaluation -> An ideal AGI evaluation
#16
tiansiyuan
closed
1 year ago
1
Add Llama 2 as model evaluated?
#15
tiansiyuan
opened
1 year ago
3
Update README
#14
AIRobotZhang
closed
1 year ago
0
Add CHBias
#13
mengf1
closed
1 year ago
0
Update README.md
#12
sileod
closed
1 year ago
0
Suggestion for adding OpenCompass to survey
#11
gaotongxiao
closed
1 year ago
2
ARB Benchmark
#10
kennethleungty
closed
1 year ago
2
Suggestion about adding one evaluation paper about LLMs in science
#9
taichengguo
closed
1 year ago
2
Update README.md
#8
kennymckormick
closed
1 year ago
0
Add a new piece of work intersted in Arabic language to the translation section
#7
Aml-Hassan-Abd-El-hamid
closed
1 year ago
0
Add CARE-MI paper + Typographic enhancements
#6
kennethleungty
closed
1 year ago
0
Correct typographic issue in framework.png
#5
kennethleungty
closed
1 year ago
1
Typographic and formatting enhancements
#4
kennethleungty
closed
1 year ago
0
Add a new paper.
#3
Wangpeiyi9979
closed
1 year ago
1
added mindgames (tom)
#2
sileod
closed
1 year ago
0
update README.md
#1
tahmedge
closed
1 year ago
4