soilwise-he / harvesters

MIT License
0 stars 0 forks source link

Import/harvest records from the ESDAC data repository #5

Closed pvgenuchten closed 2 weeks ago

pvgenuchten commented 3 months ago

As described in #4, the records of the current ESDAC repository should be imported at intervals. The ESDAC repository is a Drupal CMS instance with limited harvesting capabilities. Most optimal would be to receive a database dump of the drupal database at intervals or the xls export is extended with all metadata properties.

The above effort does not include maps,services,applications and documents, such as EUDASM

The ESDAC portal includes a section with soil knowledge on relevant environmental themes, from the knowledge themes, links are available to underlying data evidence from the data repository

image

ESDAC includes various other registries, such as:

To be clarified, the relation between EUSO/SWR and ESDAC, for which aspects will ESDAC act as a source for the resources in SWR

pvgenuchten commented 1 month ago

the esdac records are also harvested by the impact4soil.com tool by Cirad, idea is to setup a meeting with Cirad to understand which tooling/approach they use.

[Update] this was a manual effort

pvgenuchten commented 2 weeks ago

Resolved by 0ff7529305fc454a1b438955d5a8c1f7794f1fbd

It introduces an interesting approach to extract RDFa from each of the dataset pages

Only for the datasets listed in https://esdac.jrc.ec.europa.eu/dataset-list/dataset/28

Not sure how the other datasets can be found