klangner / matrobot

GAE application for presenting GitHub repository activity
www.matrobot.com
2 stars 1 forks source link

Measuring active contributors #3

Closed aronlindberg closed 11 years ago

aronlindberg commented 11 years ago

It would be very useful to be able to see the distribution of committers in every period. How many people committed in a certain period? How are the commits distributed between these people?

klangner commented 11 years ago

I think I'll first prepare the visualization of people activity (similar to repository). Then I'll try to show connected information like people activity on specific repository

aronlindberg commented 11 years ago

Sounds reasonable!

klangner commented 11 years ago

Since it looks that there is a problem with archived data. I'll wait with adding information about user activity. But for testing purposes I've uploaded number of repository committers for 3 last months.

aronlindberg commented 11 years ago

Makes sense - however problems with data might be persistent, so we might have to accept the data we can get

klangner commented 11 years ago

Yes you are right. Maybe we would have to accept it, but even then I would like to know how big the problem with data is. So I think about writing crawler to get data directly from github and compare it to githubarchive. The only problem is that activity on github is only for month or 2 so I can do it only for recent months. But better this then nothing. If the problem is with 2-3% of data then ok.

klangner commented 11 years ago

There is new chart with developer activity in each month. Data is being added from 2012-11 back. I'll close this issue now. More detailed information about user activity will be provided by github-analysis project. And there are already created issues for this functionality.

If you think that still something should be added in this ticket, just reopen it so we can discuss it