andreaspacher / openeditors

Webscraping data about editors of scientific journals.
https://openeditors.ooir.org/
Creative Commons Zero v1.0 Universal
54 stars 11 forks source link
academia bibliometrics editors journals meta-science publishers research science scientometrics

Open Editors

Purpose

Webscraping data about scholarly journals' editors and editorial boards.

For methodical details and more information, see this preprint at SocArXiv.

Results

Basic Data

Data about ca. 594.580 editorial positions at 7.352 journals from 26 publishers were found.

To see the results, see the Output-folder, or browse through the dataset at https://openeditors.ooir.org/.

In the Output/2022-Scraping-folder, the data were split into two files to avoid documents of over 100 MB (editors1.csv and editors2.csv).

A parquet-file (editors.parquet) offers a single document.

Note that the Output/2021-Scraping-folder include files with *_ror.csv. They contain ROR-identifiers, while the ones with *_ror_countries.csv have yet another column with automatically extracted country names. (I plan to extend those data to the files in Output/2022-Scraping in the near future.)

Data Sample

The current version contains data from journals pertaining to the following 21 (non-predatory) publishers:

Predatory Publishers

The five predatory publishers in the sample are:

Journal URLs

The URLs to the journal websites (as visible in the Data-folder) stems from https://github.com/andreaspacher/academic-publishers.

Related Projects

Jan Kamlah and Philipp Zumstein have scraped editors from Springer, Wiley & Elsevier with Python at https://github.com/JKamlah/scrape-editorial-board.