salvacorts / Spark-INFFC

🐘 Noise filtering method for Spark based on the fusion of classifiers and noise score metrics
Apache License 2.0
0 stars 0 forks source link

Ensemble Noise Classifier #2

Closed salvacorts closed 3 years ago

salvacorts commented 3 years ago

Create an ensemble of classifiers that can fit a dataset and can be used to predict whether new instances are a candidate to be noise or not.

Support the following two schemes:

Ideally, we would like to be able to insert and remove classifiers from the ensemble so we can easily test different combinations.

A classifier will report an example to be noisy if the prediction does not match the original label. Note that this ensemble should be used with the same dataset it was trained.

salvacorts commented 3 years ago

Closed with commit bad0511