huawei-noah / streamDM

Stream Data Mining Library for Spark Streaming
http://streamdm.noahlab.com.hk/
Apache License 2.0
492 stars 147 forks source link

StreamDM-97 #99

Closed ioanna-ki closed 6 years ago

ioanna-ki commented 6 years ago

Summary of the changes

Function train of StreamKM needs to be combined with the assign method, since data are lost between the calls. Function trainStreamKM updates the bucketManager with the incoming input, then runs kmeans.cluster and assigns the input to the nearest cluster's index. To call trainStreamKM "ClusteringTrainEvaluate" needs some changes as well.

Classes that are affected

StreamKM, ClusteringTrainEvaluate

Tests

Run with command: ./spark.sh "ClusteringTrainEvaluate -c (StreamKM) -s (SocketTextStreamReader)" 1> result_iris_streamKM.txt 2> log_iris_streamKM.log

hmgomes commented 6 years ago

Hi @ioanna-ki ,

Thanks for applying the requested changes, let's now wait for the other members' reviews

Best Regards,

Heitor