sramirez / spark-MDLP-discretization

Spark implementation of Fayyad's discretizer based on Minimum Description Length Principle (MDLP)
Apache License 2.0
44 stars 27 forks source link

Issue 19 min bin weight bb #20

Closed barrybecker4 closed 8 years ago

barrybecker4 commented 8 years ago

This pull request for #19 avoids there ever being buckets generated that contain a really small number of records relative to the total size of the dataset.