Closed lewtun closed 2 years ago
This PR fixes a bug in the way we determined which models were previously evaluated. With this fix, duplicate evaluations should be prevented for most configurations (we still have an edge case with metrics)
This PR fixes a bug in the way we determined which models were previously evaluated. With this fix, duplicate evaluations should be prevented for most configurations (we still have an edge case with metrics)