This PR ports the code to use the promptflow-evals SDK for the evaluation functionality, as the evaluate functionality is being deprecated in azure-ai-generative. The Q&A generation is still in azure-ai-generative for now.
Some user-facing changes:
Renamed custom metrics to "mygroundedness", "myrelevance", "mycoherence" to make it clear they're not the built-in metrics.
Purpose
This PR ports the code to use the promptflow-evals SDK for the evaluation functionality, as the evaluate functionality is being deprecated in azure-ai-generative. The Q&A generation is still in azure-ai-generative for now.
Some user-facing changes:
Does this introduce a breaking change?
Pull Request Type
What kind of change does this Pull Request introduce?
How to Test