issues
search
mazzzystar
/
TurtleBench
Benchmark for LLM Reasoning & Understanding with Challenging Tasks from Real Users.
https://mazzzystar.github.io/2024/08/09/turtle-benchmark-zh/
Apache License 2.0
106
stars
8
forks
source link
issues
Newest
Newest
Most commented
Recently updated
Oldest
Least commented
Least recently updated
refactor: restructure all code for TurtleBench project
#7
Duguce
closed
2 days ago
0
Jerryioo patch 1
#6
jerryioo
closed
3 weeks ago
0
Using LLMs as players to test which model asks the right questions.
#5
iamsk
closed
2 days ago
4
Cot prompts
#4
ax7e
opened
1 month ago
2
make it works and formatting the code.
#3
iamsk
closed
1 month ago
1
another implementation method
#2
waleyGithub
opened
1 month ago
1
chinese复现失败,差5个点以上
#1
siftxxx
closed
1 month ago
4