Keerthi4308 / Reddit-Assignment

0 stars 1 forks source link

Churn #2

Open Keerthi4308 opened 5 years ago

Keerthi4308 commented 5 years ago

1)Divide the dataset for each of 14 days

Calculate churn, etc over four 14-day periods (14, 28, 42, 56) days between 2015 January to 2015 February.To make item 2 easier, professor split up RC_2015-01.bz2 and RC_2015-02.bz2 into chunks (of 1,000,000 lines) and put these in /l/research/social-media-mining/public/reddit-comments-2015-split .

2) Churn rate formula -- plz update @Lrschaede

[No.of not active user/Total no.of active users]

Lrschaede commented 5 years ago

sorry this is so late. I'm just now seeing that you asked me for it. What I had written down from class is: (Number of active users at the start + num of active users at the end)/ 2