simonw llm-evals-plugin issues - Githubissues

simonw / llm-evals-plugin

Run evals using LLM

20 stars 0 forks source link

issues

Newest

Newest Most commented Recently updated Oldest Least commented Least recently updated

Evaluating RAG outputs?

#10 vikram-s-narayan opened 6 months ago
0
Researching evaluations

#9 irthomasthomas opened 7 months ago
0
Run a subset of MMLU

#8 simonw opened 7 months ago
9
Ability to store evals in the database and run them from there too

#7 simonw opened 7 months ago
1
Design and document checks and plugin hook

#6 simonw opened 7 months ago
5
Design and implement better error reporting

#5 simonw opened 7 months ago
3
Design and implement parameterization mechanism

#4 simonw opened 7 months ago
7
Log results to database

#3 simonw opened 7 months ago
0
Ship alpha to PyPI

#2 simonw closed 7 months ago
2
Initial design

#1 simonw closed 7 months ago
19