issues
search
simonw
/
llm-evals-plugin
Run evals using LLM
20
stars
0
forks
source link
issues
Newest
Newest
Most commented
Recently updated
Oldest
Least commented
Least recently updated
Evaluating RAG outputs?
#10
vikram-s-narayan
opened
6 months ago
0
Researching evaluations
#9
irthomasthomas
opened
7 months ago
0
Run a subset of MMLU
#8
simonw
opened
7 months ago
9
Ability to store evals in the database and run them from there too
#7
simonw
opened
7 months ago
1
Design and document checks and plugin hook
#6
simonw
opened
7 months ago
5
Design and implement better error reporting
#5
simonw
opened
7 months ago
3
Design and implement parameterization mechanism
#4
simonw
opened
7 months ago
7
Log results to database
#3
simonw
opened
7 months ago
0
Ship alpha to PyPI
#2
simonw
closed
7 months ago
2
Initial design
#1
simonw
closed
7 months ago
19