rust-ml / linfa

A Rust machine learning framework.
Apache License 2.0
3.76k stars 248 forks source link

Ideas for testing #65

Open bytesnake opened 3 years ago

bytesnake commented 3 years ago

This issue suggests some ideas, which you might use to improve your testing. Normally writing tests can be a very time-consuming task and it is crucial to have a large number of tests for good coverage.

If you have any specific test idea for any algorithm in the linfa ecosystem, please add a comment below :tada:

Sauro98 commented 3 years ago

Would it be ok to write tests referencing the datasets in the dataset folder (like iris) to try and replicate scikit-learn's tests?

bytesnake commented 3 years ago

good objection, I have create a PR https://github.com/rust-ml/linfa/pull/72, which introduces linfa-datasets for this purpose

bytesnake commented 3 years ago

there is now a small section at the end of the CONTRIBUTE file explaining how to use linfa-datasets

Sauro98 commented 3 years ago

Thank you!

YuhanLiin commented 3 years ago

Can we have benchmarks that measure the algorithms' accuracy values rather than performance? For example, for clustering algorithms we can measure the sum of squared distances from the nearest centroid as a metric for accuracy.