usap-dc-dev / usap-dc-website

repository for usap-dc website. Includes javascript client side app and python/flask server side.
2 stars 0 forks source link

fix and update science-on-schema.org json-ld info in dataset landing pages #51

Open fnitsche2001 opened 3 months ago

fnitsche2001 commented 3 months ago

There is an alternative, however, over which I do have control: we can switch the USAP-DC harvest to rely on the schema.org protocol rather than the older GMN software we are using currently. Since USAP-DC already has Dataset schema.org JSON-LD descriptions embedded in its dataset landing pages, we would only need to fix a few issues there in order to get this working again. I outline what I think are the key issues briefly below. A much more in-depth description of how to describe scientific datasets in schema.org protocol is available via science-on-schema.org (SOSO). There are many ways to describe a dataset, but the indispensable resource for figuring out how to do this is the SOSO Dataset description guide https://github.com/ESIPFed/science-on-schema.org/blob/master/guides/Dataset.md. In addition to reviving metadata flows to DataONE, these changes would likely also have the added benefit of moving your dataset landing pages into the Google indexer's good graces.

Here are the major required changes I will highlight:

  1. Replace unsupported Unicode HTML character codes in the Githubissues.
  2. Githubissues is a development platform for aggregating issues.