Open Ly0n opened 6 months ago
I've fixed the API to include two citation fields total_citations
and citation_counts
, there are 487 projects with citations, totalling 177909 citations.
I only see 28 projects with citations so far:
Ok my mistake. All the data is in the JSON.
I took some time today and tried to analyze the quality of the citations on a random sample basis. The citations of the first DOI found on the README gives halfway decen results. However I can only find 336 repositories with more than 0 citations. For a ranking we would have to filter a lot of projects out that just mention a DOI in the README from another paper. Here my results in the Colab: https://colab.research.google.com/github/protontypes/osta/blob/main/repository_insights.ipynb One table is interactive. Just set the entries per page to 100
The citation counts in the ecosystem repository API are all null. Therefor as a workaround for our study, I implemented 3 different APIs in our notebook for mapping DOI to citation counts. It might be interesting for ecosyste.ms to implement several of these APIs.
https://github.com/danielnsilva/semanticscholar https://github.com/J535D165/pyalex https://github.com/sckott/habanero
All Zenodo created DOIs will give me a zero citations. It might be interesting to get the zenodo metadata by the native API with such a tool: https://github.com/dvolgyes/zenodo_get