DataONEorg / mnlite

Light weight read-only DataONE member node in Python Flask
Apache License 2.0
0 stars 0 forks source link

Generate dummy sitemaps from Datacite queries #61

Open iannesbitt opened 7 months ago

iannesbitt commented 7 months ago

Datacite does not have a dataset-focused sitemap. In order to scrape schema.org metadata, mnlite would need to generate a dummy sitemap from Datacite API queries. Parameters would need to include (probably):

JSON -> sitemap value mappings:

---
title: Value map for Datacite dummy sitemap creation
---
flowchart LR
    id["JSON:id"]
    api["sitemap entry with url tags enclosing https://api.datacite.org/dois/{id}"]
    updated["JSON:attributes:updated"]
    lastmod["lastmod tags enclosing {updated}"]
    url["JSON:attributes:url"]
    mnurl["sysmeta url of {url}"]
    id --> api
    updated --> lastmod
    url --> mnurl

When requesting individual records from the dummy sitemap, mnlite will use content negotiation to request JSON-LD 🎉

iannesbitt commented 7 months ago

Datacite API docs: https://support.datacite.org/docs/api

iannesbitt commented 1 month ago

This feature may also be needed for DataONEorg/member-repos#96.