Description of changes:
This PR makes the following changes:
Update GetModelResponse and all three summarization accuracy metric transforms to support multiple input keys
Update TransformPipeline.execute to materialize intermediate datasets (as I've observed that not materializing the dataset leads to a performance degradation in SummarizationAccuracy evaluations). This change causes the performance of SummarizationAccuracy to match the performance of the old code.
Refactor the summarization accuracy metrics to use a parent ABC, mimicking the approach taken with the semantic perturbation transforms.
Fix bug in semantic_perturbations.py where I didn't unpack *args and **kwargs. Updated the unit test to catch this bug.
By submitting this pull request, I confirm that you can use, modify, copy, and redistribute this contribution, under the terms of your choice.
Description of changes: This PR makes the following changes:
GetModelResponse
and all three summarization accuracy metric transforms to support multiple input keysTransformPipeline.execute
to materialize intermediate datasets (as I've observed that not materializing the dataset leads to a performance degradation inSummarizationAccuracy
evaluations). This change causes the performance ofSummarizationAccuracy
to match the performance of the old code.semantic_perturbations.py
where I didn't unpack*args
and**kwargs
. Updated the unit test to catch this bug.By submitting this pull request, I confirm that you can use, modify, copy, and redistribute this contribution, under the terms of your choice.