NIAID-Data-Ecosystem / nde-crawlers

Harvesting infrastructure to collect and standardize dataset and computational tool metadata
Apache License 2.0
0 stars 0 forks source link

[Normalization] Fix and link the `funding.identifer` entries using NIH RePORTER API #91

Open flaneuse opened 1 year ago

flaneuse commented 1 year ago

People fill out funding.identifier in all sorts of weird wacky formats. Clean up the:

using the NIH RePORTER API

gtsueng commented 8 months ago

Note that there are a number of records with funding info that consists only of funding.funder.name. These records will not be normalized via the funding.identifier.

Normalize all funding.funder to the crossref funder registry, using funding.funder.identifier so that a search for one funding.funder.name will be able to pull all records funded by the same funder (regardless of how the funding.funder.name was originally entered)