issues
search
mazzzystar
/
TurtleBench
TurtleBench: Evaluating Top Language Models via Real-World Yes/No Puzzles.
https://arxiv.org/abs/2410.05262
Apache License 2.0
125
stars
9
forks
source link
issues
Newest
Newest
Most commented
Recently updated
Oldest
Least commented
Least recently updated
用户问题已经揭晓了汤底这种情况的prompt
#8
baoblei
opened
23 hours ago
3
refactor: restructure all code for TurtleBench project
#7
Duguce
closed
1 month ago
0
Jerryioo patch 1
#6
jerryioo
closed
2 months ago
0
Using LLMs as players to test which model asks the right questions.
#5
iamsk
closed
1 month ago
4
Cot prompts
#4
ax7e
opened
3 months ago
2
make it works and formatting the code.
#3
iamsk
closed
3 months ago
1
another implementation method
#2
waleyGithub
opened
3 months ago
1
chinese复现失败,差5个点以上
#1
siftxxx
closed
3 months ago
4