JuliaAI / DecisionTree.jl

Julia implementation of Decision Tree (CART) and Random Forest algorithms
Other
356 stars 102 forks source link

Support for DataFrame based data and model formulas #13

Open saosebastiao opened 10 years ago

saosebastiao commented 10 years ago

Hello, thanks for writing this. I've benchmarked its use against the default randomForest implementation in R and have found it to be amazingly fast.

I was hoping to be able to use this library with DataFrames, including the Model Formula format api. I know that DataFrames currently doesn't support categorical data columns, but I think it is planned to be integrated.

I can try to help contribute to this, but it would be nice if this project was merged into the JuliaStats project first (I prefer to contribute to projects that are explicitly community owned).

bensadeghi commented 10 years ago

Thanks for trying out and benchmarking the package. I've been meaning to add support for DataFrames and model formula syntax, but just haven't gotten around to it. As for merging with JuliaStats, I feel that perhaps I should write a new consistent API first (via wrappers). Let me look into it. Also, take a look at MachineLearning.jl and RandomForests.jl. Cheers.

cstjean commented 8 years ago

FWIW, I've been using DecisionTree.jl with DataFrames via ScikitLearn.DataFrameMapper and ScikitLearn.Pipeline. See here