Azure-Samples / rag-postgres-openai-python

A RAG app to ask questions about rows in a database table. Deployable on Azure Container Apps with PostgreSQL Flexible Server.
MIT License
263 stars 131 forks source link

Evaluate a high temperature (Do not merge) #120

Closed pamelafox closed 1 month ago

pamelafox commented 1 month ago

/evaluate

github-actions[bot] commented 1 month ago

Starting evaluation! Check the Actions tab for progress, or wait for a comment with the results.

github-actions[bot] commented 1 month ago

Evaluation results

metric stat baseline pr120
gpt_groundedness pass_rate 1.0 0.9
mean_rating 5.0 4.6
gpt_relevance pass_rate 1.0 0.4
mean_rating 5.0 2.8
answer_length mean 978.9 4223.4
latency mean 2.51 6.67
citation_match rate 1.0 1.0
num_questions total 10 10

Check the workflow run for more details.