jeffheaton / t81_558_deep_learning

T81-558: Keras - Applications of Deep Neural Networks @Washington University in St. Louis
https://sites.wustl.edu/jeffheaton/t81-558/
Other
5.71k stars 3.04k forks source link

Not clear what you mean by "dummy variables" in assignment two. #72

Closed pyrofolium closed 4 years ago

pyrofolium commented 4 years ago

You mention that the final dataframe should have a bunch of columns but what data is located in these columns? Can you be more clear?

jeffheaton commented 4 years ago

Dummy variables are explained here: https://github.com/jeffheaton/t81_558_deep_learning/blob/master/t81_558_class_02_2_pandas_cat.ipynb

Basically, if you have a column that is textual, and may hold any of the distinct values "a", "b", "c", you expand it to 3 columns and encode as follows

a = 0,0,1 b = 0,1,0 c = 1,0,0