Add the function to get prediction for each of the queries from golden_dataset. Prediction is used as comparison to retrieve metrics.
Usage example:
from evaluation import run_llm_for_eval, goldens
# set up orchestration, session, set uuid
eval_list = await run_llm_for_eval(goldens, orchestration, session, session_id)
Add the function to get prediction for each of the queries from golden_dataset. Prediction is used as comparison to retrieve metrics.
Usage example: