soilwise-he / triplestore-virtuoso

a triple store to store the soilwise knowledge graph
Creative Commons Zero v1.0 Universal
0 stars 0 forks source link

deploy virtuoso and load some data #3

Open pvgenuchten opened 5 months ago

pvgenuchten commented 5 months ago

a getting started on loading some dcat data

as described in EJPsoil wiki you can spin a local instance of virtuoso using this docker compose.

you can upload the zip attached (as extracted from https://nationaalgeoregister.nl) using quad-store-upload (in the linked data menu of virtuoso conductor), set the iri to some iri (you need it when querying the graph)

image

Run a sparql query from the sparql panel:

summary of graph

select distinct ?class (count(?thing) as ?numInstances) where {
  graph <http://soilwise-he.eu> {
  ?thing a ?class .
  }
}

datasets per organisation

PREFIX dcat: <http://www.w3.org/ns/dcat#>
PREFIX dcterms: <http://purl.org/dc/terms/>

SELECT ?organization (count (distinct ?dataset) as ?numDatasets)
(count (distinct ?distribution) as ?numDistributions)
WHERE {
  graph <http://soilwise-he.eu> {
 ?dataset a dcat:Dataset ;
          dcterms:publisher ?organization ;
                dcat:distribution ?distribution .  }
} 
group by ?organization
order by desc(?numDatasets)
limit 20

count by license

PREFIX dcat: <http://www.w3.org/ns/dcat#>
PREFIX dct: <http://purl.org/dc/terms/>

SELECT ?url (count(distinct ?dataset) as ?count) WHERE {
SELECT (IRI(?name) AS ?url) ?dataset WHERE {
            ?dataset a dcat:Dataset .
            ?dataset dct:license ?name
            FILTER(isLiteral(?name))
        }
} GROUP BY ?url

count by format

PREFIX dcat: <http://www.w3.org/ns/dcat#>
PREFIX dct: <http://purl.org/dc/terms/>

SELECT ?format (count(distinct ?dataset) as ?count) WHERE {
SELECT ?dataset ?format WHERE {
  ?dataset dcat:distribution ?distribution .
  ?distribution dct:format ?format .
}
} GROUP BY ?format

Exported records from NGR https://www.nationaalgeoregister.nl/geonetwork/srv/api/rdf.search?from=100. Some observations:

Download: rdf200.zip

hugodegrootwurnl commented 5 months ago

Added glosis codelists: