Various DCAT tools for harvesting metadata from Belgian open data portals, converting metadata to DCAT-AP files and updating the Belgian data.gov.be portal.
The portal itself is a Drupal 9 website, based on Fedict's Openfed distribution.
Only interested in the result ? The N-Triples and XML files (DCAT-AP) used to update data.gov.be can be found in the dcat repository
These tools can be used with a Java runtime 17 or newer, on a headless machine, i.e. there is no fancy GUI.
Internet connection is obviously required, although a proxy can be used.
There is also separate, stand-alone RDF validator project which can be used to validate DCAT metadata, regardless if the metadata is to be published on data.gov.be or not.
all
) should be harvested using the scrapers.all
enhancer to merge all the files from the various portals into one file datagovbe.nt
datagovbe_edp.xml
datagovbe.nt
and datagovbe_edp.xml
to githubSee also the Notes