ror-community / ror-roadmap

Central information about what is happening at ROR and how to contribute feedback
10 stars 1 forks source link

[FEATURE] Add Unique Entity ID (UEI) tags as external IDs #203

Open Marshlight opened 8 months ago

Marshlight commented 8 months ago

Describe the problem you would like to solve Domestic and international organizations that receive funding from the US federal government through SAM.gov must have a UEI (https://sam.gov/content/duns-uei). It would be helpful to link UEI to ROR, as some USG funders are moving toward using UEI to disambiguate awardee institutions, and interoperability with ROR would (hopefully) increase ease of ROR adoption for other data calls.

Describe the solution you'd like Add UEI to ROR external IDs.

Who would benefit from this feature? Government funders trying to track the output of organizations with UEI; those with UEI looking for their ROR (this is admittedly niche)

Additional information I am using the Edugain ID ticket (https://github.com/ror-community/ror-roadmap/issues/146) as reference for this one. I have done some UEI to GRID ID work already, and I'm sure there are some followup questions I'll need to answer.

amandafrench commented 7 months ago

@Marshlight Thanks so much for submitting this! Do you think it's possible to publish at least a UEI to ROR mapping in spreadsheet form? If so, we'd be happy to include that in our user documentation ASAP.

amandafrench commented 7 months ago

Or, I should add, the UEI to GRID mapping -- that would be easy to add ROR to, since the ROR dataset is already natively mapped to GRID.

Marshlight commented 7 months ago

@amandafrench we are working on cleaning up a UEI to GRID map for you, but are running into some data QA problems - multiple UEIs per GRID. This is probably one of those things that will need continued curation and I can't guarantee it's going to be complete or reliable...but we can send what we have, soon!

amandafrench commented 7 months ago

@Marshlight Ah, interesting. But yes, I think many others would be interested in even a rough version of this mapping. Thanks for working on it!

Marshlight commented 7 months ago

crosslinked-institution-identifiers.csv Ok, here is a rough draft with institution names, GRID ID, UEI (from SAM.gov), CAGE (a DOD specific ID), and DUNS (old and not complete but it's there anyway). You'll notice several duplicate lines, most notably for West Virginia University which has 10 rows for some reason. I have not tried to simplify this yet in any way yet, but I know it needs to be done. I probably won't get to it in the short term, but happy to further discuss QA.

poworoznek commented 7 months ago

GRID/ROR to UEI will probably not end up being exactly 1:1 due to differences in the taxonomy but may get close(r). GRID to CAGE will always be 1:many for many institutions.

amandafrench commented 7 months ago

@Marshlight Sorry, I'm getting a "Not Found" message when I click on the link to the csv - you can email it to support at ror dot org if you like

Marshlight commented 7 months ago

crosslinked-institution-identifiers.csv Does this work instead? Weird!

amandafrench commented 7 months ago

@Marshlight Yes, that worked! Thanks!