mi3nts / MintsML.jl

https://mi3nts.github.io/MintsML.jl/
MIT License
0 stars 0 forks source link

add small set of test datasets to be used for regression testing #64

Closed john-waczak closed 10 months ago

john-waczak commented 11 months ago

A simple regression dataset like dr. lary's nonlinear function from class as well as the iris dataset.

All of our big "curated" datasets can be added to OSN in a new folder, say "/ml-datasets"

john-waczak commented 11 months ago

e.g. Dr. Lary's test function:

$f(x,y) = x^5 + y^3 - x^4 - y^3$

john-waczak commented 11 months ago

iris dataset, mnist, etc...

john-waczak commented 11 months ago

colors dataset for unsupervised classification methods e.g. KNN, DBSCAN, SelfOrganizingMaps, GTM, etc...

john-waczak commented 11 months ago

We should add some test datasets with missing values, nans, infs, etc to test the pipeline