earthcubearchitecture-project418 / services

RPC services package for Project 418
0 stars 2 forks source link

Blazegraph OutofMemory errors on simple queries #3

Closed ashepherd closed 6 years ago

ashepherd commented 6 years ago

@fils Do you know if there's an easy way to ramp up the memory on Blazegraph?

ERROR: SPARQL-QUERY: queryStr=SELECT DISTINCT ?g ?class ?property WHERE { GRAPH ?g { ?s rdf:type ?class . ?s ?property [] } } ORDER BY ?g ?class ?property

java.lang.OutOfMemoryError: GC overhead limit exceeded

fils commented 6 years ago

@ashepherd I double the amount of memory the triplestore gets.

however I get a different error with that query

ERROR: SPARQL-QUERY: queryStr=
SELECT DISTINCT ?g ?class ?property WHERE { GRAPH ?g { ?s rdf:type ?class . ?s ?property [] } } ORDER BY ?g ?class ?property 
java.util.concurrent.ExecutionException: java.util.concurrent.ExecutionException: 
com.bigdata.rdf.sparql.ast.QuadsOperationInTriplesModeException: Use of WITH and GRAPH 
constructs in query body is not supported in triples mode.
fils commented 6 years ago

@ashepherd I was able to do

SELECT ?p (COUNT(?p) AS ?count)
WHERE
{
  ?s ?p ?o
} 
GROUP BY ?p
ORDER By DESC(?count)

and get the following in 150 ms
So hopefully the extra memory enables things now


p | count
-- | --
rdf:type | 594543
schema:description | 534432
schema:url | 533346
schema:unitText | 491071
schema:value | 491071
schema:variableMeasured | 491071
schema:name | 46407
schema:publisher | 21081
schema:contentUrl | 20714
schema:distribution | 20714
schema:keywords | 18219
schema:geo | 18171
schema:latitude | 18171
schema:longitude | 18171
schema:spatialCoverage | 18171
schema:includedInDataCatalog | 12178
schema:identifier | 7340
schema:alternateName | 5247
schema:license | 2910
schema:version | 2753
schema:contentLocation | 2720
schema:temporalCoverage | 2653
schema:datePublished | 2543
schema:encodingFormat | 2543
....  truncated 
fils commented 6 years ago

just to test it on types I did

SELECT ?type (COUNT(?type) AS ?count)
WHERE
{
  ?res a ?type
} 
GROUP BY ?type
ORDER By DESC(?count)

got in 167 ms

schema:PropertyValue | 491071
schema:Dataset | 21186
schema:Place | 20891
schema:DataDownload | 20714
schema:Organization | 18464
schema:GeoCoordinates | 18171
schema:DataCatalog | 2910
schema:Person | 1125
schema:SearchAction | 4
schema:EntryPoint | 3
schema:ContactPoint | 1
schema:DigitalDocument | 1
schema:ProgramMembership | 1
schema:WebSite | 1
fils commented 6 years ago

Resolved by using virtuoso for assay