datumbox / datumbox-framework

Datumbox is an open-source Machine Learning framework written in Java which allows the rapid development of Machine Learning and Statistical applications.
http://www.datumbox.com/
Apache License 2.0
1.09k stars 282 forks source link

Cross Validation in Datumbox for parameter selection #13

Closed shoubhik closed 8 years ago

shoubhik commented 8 years ago

Does Datumbox support cross-validation internally to tune the parameters? I see in this post (http://blog.datumbox.com/how-to-build-your-own-twitter-sentiment-analysis-tool/) you talk about a 10 fold cross validation. Do we need to do it on our own? I could not find any example for such. Currently, what is the best way for tuning parameters in Datumbox.

datumbox commented 8 years ago

Yes it is supported. All the Modeler classes of the framework support the kFoldCrossValidation() method which can be used for cross validating their parameters.

metsvisser commented 7 years ago

Could you please specify how to use this Method? Which object should I call it on? It seems classifier.kFoldCrossValidate(dataframe, ...) does not work. I am not such an experienced Java Developer. There seems to be no other object in the TextClassification class that I can call it on.