jainpranj / lending-club-data-analysis

0 stars 2 forks source link

Hierarchical clustering issues #26

Open nehalbhanushali opened 7 years ago

nehalbhanushali commented 7 years ago

Result for : df-ing the label and concatinating the label with the df-being-clustered

image

nehalbhanushali commented 7 years ago

Instead, make a series of the label column as a column on the df which is being clustered.

subset['complete.label'] = pd.Series(complete.labels_, index=subset.index)
nehalbhanushali commented 7 years ago

Memory issue over 10000 rows