julianconstantine / Sberbank

Code for Sberbank Kaggle competition
0 stars 0 forks source link

Figure out which columns should be binarized #1

Open julianconstantine opened 7 years ago

julianconstantine commented 7 years ago

When you run pandas's get_dummies() function (e.g. in my simple_xgb.py script), you need to specify which variables get binarized/turned into dummies. Otherwise, it will try to do this automatically, which probably isn't the best idea with ~300 features.

Also make sure to set drop_first to True next time you edit this.

julianconstantine commented 7 years ago

Right now the list is the following: