ecosyste-ms / ost

A curated list of open technology projects to sustain a stable climate, energy supply, biodiversity and natural resources, based on data from https://opensustain.tech
https://ost.ecosyste.ms
GNU Affero General Public License v3.0
7 stars 1 forks source link

Comparison of DOI to Citation APIs #231

Open Ly0n opened 6 months ago

Ly0n commented 6 months ago

The citation counts in the ecosystem repository API are all null. Therefor as a workaround for our study, I implemented 3 different APIs in our notebook for mapping DOI to citation counts. It might be interesting for ecosyste.ms to implement several of these APIs.

https://github.com/danielnsilva/semanticscholar https://github.com/J535D165/pyalex https://github.com/sckott/habanero

All Zenodo created DOIs will give me a zero citations. It might be interesting to get the zenodo metadata by the native API with such a tool: https://github.com/dvolgyes/zenodo_get

andrew commented 6 months ago

I've fixed the API to include two citation fields total_citations and citation_counts, there are 487 projects with citations, totalling 177909 citations.

Ly0n commented 6 months ago

I only see 28 projects with citations so far: Screenshot from 2024-05-16 08-34-39

Ly0n commented 6 months ago

Ok my mistake. All the data is in the JSON.

Ly0n commented 6 months ago

I took some time today and tried to analyze the quality of the citations on a random sample basis. The citations of the first DOI found on the README gives halfway decen results. However I can only find 336 repositories with more than 0 citations. For a ranking we would have to filter a lot of projects out that just mention a DOI in the README from another paper. Here my results in the Colab: https://colab.research.google.com/github/protontypes/osta/blob/main/repository_insights.ipynb One table is interactive. Just set the entries per page to 100