IQSS / dataverse-pm

Project management issue tracker for the Dataverse Project. Note: Related links and documents may not be public.
https://dataverse.org
0 stars 0 forks source link

Epic: Harvard Dataverse Repository NIH Metrics #217

Open cmbz opened 3 months ago

cmbz commented 3 months ago

Overview

Tracking issue for monthly reports of NIH-funded datasets in Harvard Dataverse.

Resources

How these metrics are gathered for the monthly reports

At the beginning of each month @jggautier runs a Python script that:

@jggautier then reviews any datasets that were included in previous months but removed, reviews the metadata of newly added datasets to make sure there's actually some indication of NIH funding, removes any datasets that aren't from NIH-funded research, and adjusts the script so that those datasets are ignored when the script is used again. The script is also adjusted to include datasets that @jggautier and colleagues know have been funded by the NIH and are missing such indications in their metadata.

Search details The Python script uses the Search API to look across four metadata fields - Funding Information Agency, Contributor Name, Description, and Notes - for the full name of the NIH and its acronym and the full names of all NIH centers and institutes and most of their acronyms.

When looking through metadata in the Description field and Notes field, the script also looks for variations of the words "fund", "sponsor", "award", and "support" to increase the chances that it finds only datasets with metadata that acknowledges NIH funding.

cmbz commented 3 months ago

Status: March 2024

cmbz commented 2 months ago

Status: April 2024

cmbz commented 2 months ago

Status: May 2024

cmbz commented 1 month ago

Status: June 2024