Closed nicoloboschi closed 3 weeks ago
i wonder if when running in deferred mode, this change would prevent trulens from seeing this as a failed evaluation and then would not re-run it.
i wonder if when running in deferred mode, this change would prevent trulens from seeing this as a failed evaluation and then would not re-run it.
It might be as side effect. Anyway all other feedback providers don't fail if the reason can't be found so I think it's better to align to the others' behaviour
Items to add to release announcement:
Currently if you use feedback
groundedness_measure_with_cot_reasons
and one of the hypothesis is returning no supporting evidence, the feedback is completely broken and not even the score is visible. This is due to the fact this feedback implies to always have a "reason" behind the score.Summary:
This PR fixes an issue in the
groundedness_measure_with_cot_reasons
feedback function to handle cases with no supporting evidence without breaking.Key points:
evaluate_hypothesis
function inProvider
class to handle missing 'reason' key gracefully.Generated with :heart: by ellipsis.dev