sdruskat / cff-in-the-wild

Analysis of Citation File Format files on GitHub
Apache License 2.0
2 stars 0 forks source link

cff-in-the-wild

Analysis of Citation File Format files on GitHub.

This is an SSI Collaborations Workshop 2022 hack day project 😍.

Repository structure πŸ—‚οΈ

. (root)/                    You are here.
β”œβ”€ code/                     
β”‚  β”œβ”€ data-retrieval/        Code/software for data retrieval from GitHub
β”œβ”€ data/                     The data we analyse
β”‚  β”œβ”€ raw/                   NEVER MAKE THIS PUBLIC! DATA PRIVACY!
β”œβ”€ output/
β”‚  β”œβ”€ figures/               Figures for the presentation

Introduction

The Citation File Format enables you to provide the citation metadata for your software in easy-to-read YAML files. There are now several thousand CITATION.cff files on GitHub alone.

This project looks at how these files are actually used:

Usage

A dataset with some thousand links to repositories containing CITATION.cff files are in data/cff_repositories.csv.

Some of the analyses are run in R, some in Python.

Contributing ✏️

This is a collaborative project and we welcome suggestions and contributions. We hope one of the invitations below works for you, but if not, please let us know!

πŸƒ I'm busy, I only have 1 minute

⏳ I've got 5 minutes - tell me what I should do

πŸ’» I've got a few hours to work on this

πŸŽ‰ I really want to help increase the community

Please open a GitHub issue to suggest a new idea or let us know about bugs.

Licenses βš–οΈ

Software code and notebooks from this project are licensed under the open source Apache License, v2.0 license. Project documentation and images are licensed under CC BY 4.0. Data produced by this project in the data/outputs directory is licensed under CC0. Other data included in this project from other sources remains licensed under its original license.

Contributors πŸ’–

See the CITATION.cff file :tada:.