We should avoid only testing hard values

scikit-learn-contrib / metric-learn

Metric learning algorithms in Python

MIT License

1.4k stars 234 forks source link

Some of the tests rely on testing hard values, like the ones that failed in #127 due to scikit-learn's iris dataset update. They could probably fail again if for instance we used another initialisation or another optimizer for some algorithms, while the algorithm would still be valid. Therefore I think these tests could still be useful as benchmarking tasks, to ensure we keep making a good score for some basic tasks, but we should still probably rely more on testing toy examples, testing properties of the solution rather than hard values, that would work no matter the initialization or the optimisation procedure.

scikit-learn-contrib / metric-learn

We should avoid only testing hard values #129