DataONEorg / mnlite

Light weight read-only DataONE member node in Python Flask
Apache License 2.0
0 stars 0 forks source link

Harvest multiple jsonld documents from a single HTML page #63

Closed iannesbitt closed 7 months ago

iannesbitt commented 7 months ago

The Greenland Ecosystem Monitoring repository has a single harvest page (https://api.g-e-m.dk/api/dataset/harvest). We need to be able to scrape all of the embedded JSON-LD items from that page in a single go (even though this is technically pretty inefficient because right now we define the sitemap and thus there is no lastmod field).

This will be achieved by setting "which_jsonld": "all" in the settings.json file.