xabol / date-a-scientist

0 stars 0 forks source link

One-hot Encoding #3

Open 15mikesmith opened 5 years ago

15mikesmith commented 5 years ago

Converting the non numerical features to numeric representation would have been a great opportunity to use one hot encoding. In short, by using a scale instead of One-hot Encoding the student may have inadvertently given some features(E.g. Age) imbalanced significance (I.e. 45 could be considered more significant than 15 solely based on how large the number is not the actually feature itself). I'm certain if the student had used this technique, the models accuracy would have improved.

Here's a link if the student is interested learning more: https://machinelearningmastery.com/why-one-hot-encode-data-in-machine-learning/

xabol commented 5 years ago

Thanks Mike, had not heard about one hot encoding and looked it up and it all makes sense. Thx