The-Academic-Observatory / academic-observatory-workflows

Telescopes, Workflows and Data Services for the Academic Observatory
https://academic-observatory-workflows.readthedocs.io
Apache License 2.0
16 stars 1 forks source link
academic data higher-education research-evaluation science workflow

Academic Observatory Workflows

Academic Observatory Workflows provides Apache Airflow workflows for fetching, processing and analysing data about academic institutions.

License Python Version Python package Documentation Status codecov DOI

Telescope Workflows

A telescope a type of workflow used to ingest data from different data sources, and to run workflows that process and output data to other places. Workflows are built on top of Apache Airflow's DAGs.

The workflows include: Crossref Events, Crossref Fundref, Crossref Metadata, Geonames, OpenAlex, Open Citations, ORCID, PubMed, ROR, Scopus, Unpaywall and Web of Science.

Telescope Workflow Description
Crossref Funder Registry The Crossref Funder Registry is an open registry of grant-giving organization names and identifiers, which can be used to find funder IDs and include them as part of metadata deposits. It is a freely-downloadable RDF file. It is CC0-licensed and available to integrate with your own systems. Funder names from acknowledgements should be matched with the corresponding unique funder ID from the Funder Registry.
Crossref Metadata Crossref is a non-for-profit membership organisation working on making scholarly communications better. It is an official Digital Object Identifier (DOI) Registration Agency of the International DOI Foundation. They provide metadata for every DOI that is registered with Crossref.
OpenAlex OpenAlex is a free and open catalog of the global research system.
ORCID ORCID is a non-profit organization that provides researchers with a unique digital identifier which eliminates the risk of confusing an identity with another researcher having the same name. ORCID provides a record that supports automatic links among all the researcher's professional activities.
PubMed PubMed is a free resource supporting the search and retrieval of biomedical and life sciences literature with the aim of improving health–both globally and personally.
ROR ROR is a global, community-led registry of open persistent identifiers for research organizations.
Scopus SCOPUS is an Elsevier bibliometrics database containing abstracts, citations, of journals, books, and conference proceedings.
Unpaywall Unpaywall is an open database of free scholarly articles. It includes data from open indexes like Crossref and DOAJ where it exists. Data comes from “monitoring over 50,000 unique online content hosting locations, including Gold OA journals, Hybrid journals, institutional repositories, and disciplinary repositories.

Documentation

For detailed documentation about the Academic Observatory see the Read the Docs website https://academic-observatory-workflows.readthedocs.io