Analysis of Citation File Format files on GitHub.
This is an SSI Collaborations Workshop 2022 hack day project π.
. (root)/ You are here.
ββ code/
β ββ data-retrieval/ Code/software for data retrieval from GitHub
ββ data/ The data we analyse
β ββ raw/ NEVER MAKE THIS PUBLIC! DATA PRIVACY!
ββ output/
β ββ figures/ Figures for the presentation
The Citation File Format enables you to provide the citation metadata for your software in easy-to-read YAML files. There are now several thousand CITATION.cff
files on GitHub alone.
This project looks at how these files are actually used:
A dataset with some thousand links to repositories containing CITATION.cff
files are in data/cff_repositories.csv
.
Some of the analyses are run in R, some in Python.
This is a collaborative project and we welcome suggestions and contributions. We hope one of the invitations below works for you, but if not, please let us know!
π I'm busy, I only have 1 minute
β³ I've got 5 minutes - tell me what I should do
π» I've got a few hours to work on this
π I really want to help increase the community
Please open a GitHub issue to suggest a new idea or let us know about bugs.
Software code and notebooks from this project are licensed under the open source Apache License, v2.0 license. Project documentation and images are licensed under CC BY 4.0. Data produced by this project in the data/outputs directory is licensed under CC0. Other data included in this project from other sources remains licensed under its original license.
See the CITATION.cff
file :tada:.