scikit-learn-contrib / boruta_py

Python implementations of the Boruta all-relevant feature selection method.
BSD 3-Clause "New" or "Revised" License
1.46k stars 252 forks source link

Categorical features #41

Closed Taakn closed 4 years ago

Taakn commented 6 years ago

Hello,

More of a question than an issue, but how do you handle categorical features?

Also seems like y needs to be an int? Otherwise if I leave it as float I am getting an error.

Thanks!

SymbolicSquared commented 5 years ago

For categorical features you will need to vectorize/dummy code, just like you would if you plan to use any classifier in sklearn. I suggest pandas get_dummies method.

ThomasBury commented 4 years ago

Hi,

regarding categorical:

danielhomola commented 4 years ago

Submit a PR with tests and I'll have a look.