skybristol / geokb

Data processing workflows for initializing and building the Geoscience Knowledgebase
The Unlicense
3 stars 3 forks source link

process for linking people to expertise #14

Open skybristol opened 10 months ago

skybristol commented 10 months ago

Capacity assessment use cases need an integration of concepts that people have asserted as their expertise using a combination of USGS staff profile information (scraped from web pages and cached in person item talk pages) and keywords in ORCID records. We have a start to some of these concepts via selective integration with the USGS Thesaurus - scientific methods and "science topics." Both sources are uncontrolled and do not include identifiers, so we are left with simple concept mapping logic and some amount of uncertainty.

We'll run an initial process that maps as many terms as we can from source data to what we started laying out in the GeoKB and tee up other concepts for further evaluation and integration into the graph. This will build on a notebook already started for processing cached profile data.