subugoe / hoaddata

Datasets about hybrid open access publishing
https://subugoe.github.io/hoaddata/
Creative Commons Zero v1.0 Universal
6 stars 1 forks source link

hoaddata

update-data.yaml

This package contains information about the open access uptake of hybrid journals.

The main purpose of hoaddata is to provide data for the Hybrid Open Access
Dashboard (HOAD)
, being developed at the SUB Göttingen with the support of the Deutsche Forschungsgemeinschaft. By making the data available as an R package, hoaddata can also be used for other data analytics tasks with R.

The data cover the publication period 2017 - 2023.

Installation

You can install hoaddata from GitHub with:

# install.packages("remotes")
remotes::install_github("subugoe/hoaddata", dependencies = "Imports")

Data sources

The package combines open data from multiple sources as follows:

Data gathering

The data-raw folder contains the R code used to generate the hoaddata datasets.

Most of the data was obtained by connecting to the subugoe-collaborative scholarly data warehouse, a collection of big scholarly datasets hosted on Google Big Query and maintained by the SUB Göttingen. Crossref was used to determine the publication volume and articles made available under a CC license, while affiliation data was gathered from OpenAlex.

You can find the corresponding SQL code in the inst/sql/ folder.

The data package is automatically built using GitHub Actions. Every merge event in the main branch triggers a data update by calling the scripts in the data-raw/ folder. Data changes are merged into the package and tracked with Git. This makes it easy to update and reproduce different versions of the data contained in hoaddata.

Data re-use and licenses

Datasets are released into the public domain.

Anyone is free to copy, modify, publish, use, compile, sell, or distribute these materials in any form, for any purpose, commercial or non-commercial, and by any means.

Crossref asserts no claims of ownership to individual items of bibliographic metadata and associated Digital Object Identifiers (DOIs) acquired through the use of the Crossref Free Services. Individual items of bibliographic metadata and associated DOIs may be cached and incorporated into the user's content and systems.

OpenAlex and Journal Checker Tool data are made available under the CC0 license.

Transformative Agreements Public Data is made available under the CC0 license.

This work re-used the following dataset:

Pollack, Philipp; Lindstrot, Barbara; Barbers, Irene, Stanzel, Franziska, 2021, "Open Access Monitor: Zeitschriftenlisten (V2)", https://doi.org/10.26165/JUELICH-DATA/VTQXLM.

published under CC BY 4.0.