epam / OSCI

Open Source Contributor Index
https://opensourceindex.io/
GNU General Public License v3.0
161 stars 99 forks source link

Extension of analytics scope (Add licenses and programming languages) #64

Closed vlad-isayko closed 3 years ago

vlad-isayko commented 3 years ago

Plans

We plan to expand the scope of research.

We want to add two new reports:

  1. OSCI_Languages_YTD: report on the number of the company commits in the programming language since the beginning of the year.
  2. OSCI_Licenses_YTD: report on the number of the company commits in the repository with a license since the beginning of the year.

TODO

OSCI Languages YTD

  1. create transformation function, which gets push events commits as input and returns the amount of commits report grouped by company and language;
  2. create spark job;
  3. create cli command for this job;
  4. add job to daily-osci-rankings cli command.
Example output: company language commits
Google python 50
Google go 30
Microsoft typescript 40
Microsoft powershell 20

OSCI Licenses YTD

  1. create transformation function, which gets push events commits as input and returns the amount of commits report grouped by company and license
  2. create spark job
  3. create cli command for this job
  4. add job to daily-osci-rankings cli command
Example output: company license commits
Google apache-2.0 50
Google mit 30
Microsoft gpl-3.0 40
Microsoft lgpl-2.1 20