coala / gci-leaders

A website showing Google Code-in information 🏆
https://gci-leaders.netlify.com
39 stars 45 forks source link

Detect most active GCI repo for each org #165

Open jayvdb opened 6 years ago

jayvdb commented 6 years ago

Like https://github.com/coala/gci-leaders/issues/164 ...

Im not sure how to get it using open data methodology , but lets try... ;)

li-boxuan commented 6 years ago

A possible approach:

Fetch all GCI tasks Fetch task description, one by one Use regex to match GitHub/GitLab repo in the task description

API needed is on https://developers.google.com/open-source/gci/resources/downloads/TaskAPISpec.pdf

andrewda commented 6 years ago

I'm not positive on this one, but I don't think you're able to get task descriptions from other orgs using that API? So that would work for getting our most active GCI repo, but not the most active repo for other orgs.

li-boxuan commented 6 years ago

I'm not positive on this one, but I don't think you're able to get task descriptions from other orgs using that API?

Oh really, I didn't know that. People can see tasks from any org on gci website, so I assumed we could get task descriptions using API.

andrewda commented 6 years ago

That's true, maybe that's the case. The API just it's very well documented and it didn't look like they gave you the organization_name field on most of the requests...

jayvdb commented 6 years ago

Ya the API only gives data about one org, so not useful.

There are some hacks around to log in as a student, using oauth, and then you can see all tasks from all orgs.

But all tasks from all orgs are are available to all mentors from any org, but only if you know the URL.

jayvdb commented 6 years ago

As a first attempt at this, we could just sort the repos in each github org.

The scraper can not access github dashboards for orgs that it is not a member of.

We could try to get waffle stats out of orgs which have enabled that, e.g. https://waffle.io/coala/coala-bears/metrics/throughput

Ahhhh ... https://github.com/coala/gci-leaders/issues/166 will give us a window into this information.