simonw / llm-evals-plugin

Run evals using LLM
19 stars 0 forks source link

Evaluating RAG outputs? #10

Open vikram-s-narayan opened 2 months ago

vikram-s-narayan commented 2 months ago

One of the major issues I've been running into is evaluating outputs when you have a document on which retrieval augmented generation is being performed ... especially when it's difficult to obtain baseline human responses.

Is this one of the intended use cases for this plugin?


One (potentially wild) thought: