salvacorts / Spark-INFFC

🐘 Noise filtering method for Spark based on the fusion of classifiers and noise score metrics
Apache License 2.0
0 stars 0 forks source link

Choose Classifiers #1

Closed salvacorts closed 2 years ago

salvacorts commented 3 years ago

The original paper uses the following classifiers:

Unfortunately, the ML lib does not have those methods, so we will need to search for an alrenative.

Ideally, we want to use classifiers that are robust to noise and complement each other.

The ML Lib API documentation can be fount here: https://spark.apache.org/docs/latest/api/scala/org/apache/spark/ml/classification/index.html

Further documentation including examples can be found here: https://spark.apache.org/docs/latest/ml-classification-regression.html

salvacorts commented 3 years ago

We want to have a balance between speed and accuracy/resilience to noise.

Other classifiers we may try are: