Closed Adamits closed 1 month ago
Currently, we only support evaluating with exact match accuracy. We should support more evaluation metrics and probably require a framework for how they should be added.
Closed in #184.
Currently, we only support evaluating with exact match accuracy. We should support more evaluation metrics and probably require a framework for how they should be added.