ShifuML / shifu

An end-to-end machine learning and data mining framework on Hadoop
https://github.com/ShifuML/shifu/wiki
Apache License 2.0
251 stars 109 forks source link

Add 'minCategoryCount' in stats of Categorical Variable #724

Open zhangpengshan opened 4 years ago

zhangpengshan commented 4 years ago

Some category of categorical variable has few instances, add a 'minCategoryCount' in stats for customer to customize, if category count < minCategoryCount configured, it will be set as null category.