geonetwork / core-geonetwork

GeoNetwork is a catalog application to manage spatially referenced resources. It provides powerful metadata editing and search functions as well as an interactive web map viewer. It is currently used in numerous Spatial Data Infrastructure initiatives across the world.
http://geonetwork-opensource.org/
GNU General Public License v2.0
420 stars 489 forks source link

Harvest ESRI Open Data via DCAT #4541

Open jamessewell opened 4 years ago

jamessewell commented 4 years ago

I've done some searching - but I don't think this is directly possible at the moment.

ESRI open data provides a data.json file in the DCAT format. Other solutions like CKAN are capable of using this for harvest.

An example: https://vicroadsopendata-vicroadsmaps.opendata.arcgis.com/data.json

Cheers, James

fxprunayre commented 4 years ago

An option would be to use something like https://github.com/geonetwork/core-geonetwork/pull/4034 but needs some more work.

stijngoedertier commented 4 years ago

See also the harvester for the DCAT-AP schema plugin https://github.com/metadata101/dcat-ap1.1

Because it uses Apache Jena to read the RDF, it can deal with several RDF syntaxes: json-lod, rdf/xml, ttl, etc.

Currently, the harvester inserts metadata as DCAT-AP RDF-XML, but you can also make it work for other schemas like ISO19139. Basically, you will need to write an XSLT similar to this one rdf-to-xml.xsl and a Java Harvester similar to this one.