issues
search
huggingface
/
evaluation-guidebook
Sharing both practical insights and theoretical knowledge about LLM evaluation that we gathered while managing the Open LLM Leaderboard and designing lighteval!
Other
582
stars
35
forks
source link
issues
Newest
Newest
Most commented
Recently updated
Oldest
Least commented
Least recently updated
Update links after .md files renaming
#21
martinscooper
opened
2 hours ago
0
Fix typo
#20
martinscooper
closed
4 hours ago
2
Add license in readme
#19
clefourrier
closed
3 days ago
0
Add license
#18
haesleinhuepf
closed
3 days ago
2
Reward Model based Evaluations
#17
sanderland
closed
2 days ago
2
Fix typos detected by `codespell`
#16
tancnle
closed
6 days ago
0
docs: update README.md
#15
eltociear
closed
6 days ago
0
[TOPIC] How to design a good benchmark depending on your eval goals
#14
clefourrier
opened
6 days ago
0
[Doc] Update docs
#13
Imss27
closed
6 days ago
0
Update README.md
#12
clefourrier
closed
1 week ago
0
Update Troubleshooting inference.md
#11
clefourrier
closed
1 week ago
0
Update Tips and tricks.md
#10
clefourrier
closed
1 week ago
0
Update README.md
#9
clefourrier
closed
1 week ago
0
Update About evaluation.md
#8
clefourrier
closed
1 week ago
0
Update Troubleshooting inference.md
#7
NathanHB
closed
1 week ago
0
Update Tips and tricks.md
#6
NathanHB
closed
1 week ago
0
Update Designing your evaluation prompt.md
#5
NathanHB
closed
1 week ago
0
Update Basics.md
#4
NathanHB
closed
1 week ago
0
Update Designing your automatic evaluation.md
#3
NathanHB
closed
1 week ago
0
Update Basics.md
#2
NathanHB
closed
1 week ago
0
Update Basics.md
#1
NathanHB
closed
1 week ago
0