imouzon / dmc2015

O we will fight, fight, fight for Iowa State
0 stars 4 forks source link

Category statistics #6

Open mingsnu opened 9 years ago

mingsnu commented 9 years ago

I have been studying the relationship between category and couponUsed. Here is my findings: https://github.com/imouzon/dmc2015/blob/master/weicheng/categoryExploration.md

The result shows that the coupon usage rate is different among different categories. So I think it maybe necessary to add single category columns. (The existing category columns are combined versions)

Single category (Already done)

For i = 1,2,...,31 nCategory1[i] = how many times was this category i seen in column 1 nCategory2[i] = how many times was this category i seen in column 2 nCategory3[i] = how many times was this category i seen in column 3 propCategory1[i] = what proportion of times was this category seen in column 1 propCategory2[i] = what proportion of times was this category seen in column 2 propCategory3[i] = what proportion of times was this category seen in column 3

But it seems that we have to add too many columns... So I didn't export it to the feature matrix yet, expecting for better ideas.