cncf / devstats.archive

📈CNCF-created tool for analyzing and graphing developer contributions
https://devstats.cncf.io/
Apache License 2.0
444 stars 147 forks source link

[question] organization renaming #351

Closed tormath1 closed 2 years ago

tormath1 commented 2 years ago

Hi,

Thanks for this project - by curiosity I successfully set up a test instance to fetch data from flatcar-linux github organization (https://github.com/flatcar-linux/).

Repo in this organization moved from flatcar-linux -> kinvolk -> flatcar-linux. We can still see the redirects:

$ curl -I https://github.com/kinvolk/coreos-overlay
...
location: https://github.com/flatcar-linux/coreos-overlay

On the devstat instance, there is a gap during the kinvolk period because the projects.yml is defined for flatcar-linux org. What should be the way to get data from both organization ?

I tried to edit the companies.yml, thinking that kinvolk/* activities would be ingested by flatcar-linux but no more luck.

Thanks!

lukaszgryglicki commented 2 years ago

DevStats doesn't do any redirects, I parses files from GHA archives. To track org and repo you can specify in projects.yaml: command_line like: a,b/c etc. See CNCF's projects projects.yaml file for details, also see per any project project-name/psql.sh.

tormath1 commented 2 years ago

Ouh nice, it seems to do the job, thanks. Just for my knowledge: command_line and the value in psql.sh - for example cilium,noironetworks/cilium-net in: https://github.com/cncf/devstats/blob/4cdfd0572055a094cb935692e660e6f4e780ac13/cilium/psql.sh#L16

Are the same things ?

lukaszgryglicki commented 2 years ago

Almost the same: Inproject-name/psql.sh you define "historical" command - that is used for 1st sync - so you also list old names/regexps to get repos/orgs In projects.yaml you define current command, so you can skip anything that is already historical now.

tormath1 commented 2 years ago

Ok perfect. Closing this issue, thanks for your support. :star:

lukaszgryglicki commented 2 years ago

Also note that you can specify regexps - grep for regexp in projects.yaml and */psql.sh.