huawei-noah / streamDM

Stream Data Mining Library for Spark Streaming
http://streamdm.noahlab.com.hk/
Apache License 2.0
492 stars 147 forks source link

StreamDM-101: Update to Spark 2.3.2 #103

Closed hmgomes closed 5 years ago

hmgomes commented 5 years ago

Summary of the changes

The changes in this pull request were made to update streamDM to the current Spark version, i.e., Spark 2.3.2. It shall also works with Spark 2.3.1 and Spark 2.3.

Tests

Just a sanity check to verify if the code is working with the latest version. The test uses the electNormNew.arff, which is inside \data directory, so there is no need to download a dataset to run the experiment.

  1. Execute EvaluatePrequential using electNormNew.arff

Run the command:

./spark.sh "200 EvaluatePrequential -l (trees.HoeffdingTree -l 0 -t 0.05 -g 200 -o) -s (FileReader -f ../data/electNormNew.arff -k 1000 -d 10 -i 10000) -e (BasicClassificationEvaluator -c -m) -h" 1> result_ELEC.txt 2> log_ELEC.log

Expected output: 10 rows of statistics in results_ELEC.txt