sramirez / spark-MDLP-discretization

Spark implementation of Fayyad's discretizer based on Minimum Description Length Principle (MDLP)
Apache License 2.0
44 stars 27 forks source link

Handling metadata #29

Closed rauljosepalma closed 7 years ago

rauljosepalma commented 7 years ago

It would be good if the ml version could consider the DataFrame metadata to prevent discretizing nominal attributes. Also , it would be good if the produced DataFrame could contain metadata for the discretized attributes, mainly the values property of them.