knageswara78 / Python_Solutions

0 stars 0 forks source link

Removing the Missing value variables #5

Open knageswara78 opened 6 years ago

knageswara78 commented 6 years ago

Adding the variables having less than 20% missing values.

Check missing values in whole data

train.isnull().sum()

saving missing values in a variable

a = train.isnull().sum()/len(train)*100

saving column names in a variable

variables = train.columns variable = [ ] for i in range(0,12): if a[i]<=20: #setting the threshold as 20% variable.append(variables[i])