issues
search
Jellyfish042
/
uncheatable_eval
Evaluating LLMs with Dynamic Data
MIT License
66
stars
4
forks
source link
issues
Newest
Newest
Most commented
Recently updated
Oldest
Least commented
Least recently updated
update
#8
Jellyfish042
closed
1 month ago
0
2024-08 data
#7
Jellyfish042
closed
1 month ago
0
Batch eval
#6
Jellyfish042
closed
2 months ago
0
add Evaluator and EvaluationConfig for better management
#5
Jellyfish042
closed
4 months ago
0
update
#4
Jellyfish042
closed
6 months ago
0
BBC_crawler does not work as expceted
#3
jijivski
opened
6 months ago
1
new readme
#2
Jellyfish042
closed
6 months ago
0
Potential skew due to different tokenizers or vocabularies
#1
melang982
closed
7 months ago
6