Diyago / kaggle-malware

https://www.kaggle.com/c/microsoft-malware-prediction/
1 stars 0 forks source link

Fast computation of count features #1

Open Diyago opened 5 years ago

Diyago commented 5 years ago

сomputation of count features train[c].map(train[c].value_counts())

Aggregation cols:

def add_num_feats(df, numerical_cols):
    gr = df.groupby('id')
    for col in numerical_cols:
        for agg in ['sum', 'mean', 'count']:
            df[col+'_'+agg] = gr[col].transform(agg)
    return df