Azure-Samples / rag-postgres-openai-python

A RAG app to ask questions about rows in a database table. Deployable on Azure Container Apps with PostgreSQL Flexible Server.
MIT License
262 stars 131 forks source link

Fix evaluation workflow #126

Closed pamelafox closed 1 month ago

pamelafox commented 1 month ago

Purpose

This pull request includes changes to the GitHub Actions workflow to correctly identify and use the pull request branch name when committing and pushing evaluation results. The most important changes involve adding a step to get the PR branch name and modifying the branch checkout and push commands.

Improvements to workflow:

Does this introduce a breaking change?

When developers merge from main and run the server, azd up, or azd deploy, will this produce an error? If you're not sure, try it out on an old environment.

[ ] Yes
[ ] No

Type of change

[ ] Bugfix
[ ] Feature
[ ] Code style update (formatting, local variables)
[ ] Refactoring (no functional changes, no api changes)
[ ] Documentation content changes
[ ] Other... Please describe:

Code quality checklist

See CONTRIBUTING.md for more details.

pamelafox commented 1 month ago

/evaluate

github-actions[bot] commented 1 month ago

Starting evaluation! Check the Actions tab for progress, or wait for a comment with the results.

github-actions[bot] commented 1 month ago

Evaluation results

metric stat baseline pr126
gpt_groundedness pass_rate 1.0 1.0
mean_rating 5.0 5.0
gpt_relevance pass_rate 1.0 1.0
mean_rating 5.0 5.0
answer_length mean 1017.6 1011.4
latency mean 2.56 2.62
citations_matched mean 0.73 0.73

Check the workflow run for more details.