hoteljuliet / SPEL

A DSL/EL designed for json transforms, configurable business logic, and appropriate for stream processing
Other
1 stars 1 forks source link

SPIKE: Clustering Algos #26

Open hoteljuliet opened 1 year ago

hoteljuliet commented 1 year ago

We should implement a number of them: DBSCAN, K-Means/++, others?

Apache Math has some built in:

https://commons.apache.org/proper/commons-math/javadocs/api-3.6.1/index.html?org/apache/commons/math3/ml/clustering/DBSCANClusterer.html

https://commons.apache.org/proper/commons-math/javadocs/api-3.6.1/index.html?org/apache/commons/math3/ml/clustering/DBSCANClusterer.html

hoteljuliet commented 1 year ago

If required, the statements can store a list/array of values as private members, since the entire step will exist as stateful data (either check-pointed or state-backend backed up).