spgroup / groundhog

A framework for crawling GitHub projects and raw data and to extract metrics from them
http://spgroup.github.io/groundhog
GNU General Public License v2.0
15 stars 10 forks source link

To provide an answer to the question "What are the five most used licenses?" #56

Open fernandocastor opened 11 years ago

fernandocastor commented 11 years ago

We need to implement and test the features required to use Groundhog to answer the question in the title of the issue. We then have to use it to actually answer the question.

gustavopinto commented 11 years ago

The license algorithm needs improvements, but, here are some initial results considering 25 random java projects.

{Eclipse=1, Apache=4, BSD=1, not-understandable-license=4, unlincesed=14, MIT=1}

fernandocastor commented 11 years ago

Nice. This seems to be consistent with previous results I've pointed out to.

Still, the problem remains. How do we circumvent the number of requests limitation?

rodrigoalvesvieira commented 11 years ago

@gustavopinto wow. Great! Next week I should be able to get back to work on Groundhog but this seems amazing!