Closed fernandocastor closed 11 years ago
Do we have a precise definition of what's an active project on github? There are many ways to measure this but have we already discussed the thresholds and which measures are we going to use?
Nope. What would you suggest?
@fernandocastor i think an active project is one that received a push in an certain amount of time. I really can't define that amount but i would consider 1 year a good one.
We can use the same definition of Richard Sands, which says: an active project is a project that had at least one commit and at least 2 committers in the last 12 months.
:+1: and 12 months already seem a lot, very tolerant. Let's stick with that!
I agree on that. I'll start coding. Thanks everyone for the help
I used to write my token in the SearchGitHub.oauthToken, but now is injected and can't do this anymore. Where is the correct line to place my token?
I found in HttpModule in line 19 a token, but is ok to modify it?
This token is mine. Go ahead and change it.
Thanks
@fernandocastor here is a code that answers the question from this issue. After your check i'll close this issue.
hmmmm?
I did a code that answers the question and i asked to castor for him to check. ?
Oh, I see. Is this it https://github.com/spgroup/groundhog/compare/5e37f7d00f...6bb7d4ff0f?
Yep. I'll put the '@test' in the others after validation, i deleted them so when the 'SearchGitHubTest.java' was running it didn't have to test all the others.
How many?
Of the 20 first found none of them were active. Again i had that problem with the limit of the response. I couldn't get to run more then 20
@fernandocastor i think that time my quota was full, i ran again and it was 3 projects in 20
@fernandocastor Now with the token i found a very different number. 23% of the projects were active. Tonight i'll be able to run it with 2000 projects and see the results. @gustavopinto i removed the .gitignore and i think it resolved the problem
:metal:
Even with the token i can't even run with 500 projects. It ran for 559 seconds before one of the commit requests came back negative.
I made the calculations and my quota os requests is 580. Can't get more. Any ideas?
Make it work through the night, stopping whenever it gets a negative and trying again after 60 minutes. If you keep this pace, by tomorrow morning you'll have analyzed 5000 projects.
is groundhog raising an exception, @pork9? what do you mean with negative requests?
@fernandocastor @gustavopinto i've the message of a empty repository. When i got over that, i did some code to avoid the message of exceeded request limits and wait one hour. Now i'm only trying to get the answer.
I've the answer. 31.380000000000003% of the java projects of 5000 projects were active in 2012.
Very cool. How long did it take to get information about these 5000 thousand projects?
The info of the projects took about 2,5 hours but including the commits it took about 4 hours
We need to implement and test the features required to use Groundhog to answer the question in the title of the issue. We then have to use it to actually answer the question.