Closed alexskr closed 10 months ago
OK, that's very interesting. The same issue we have been dealing with in production, right? Looks like we'll have to dig into this, but maybe using AG tooling it will be easier.
Example of ontologies that have problems with metrics: AGRO
ARGO metrics completes. However, when running metrics for larger ontologies like NCIT we see this behavior where two out of ten AllegroGraph backends are using 100% CPU and stay in this state for hours even after metrics process ERRORs out.
AllegroGraph Admin Interface shows the following under "Jobs":
Jobs
ontoportal (backend) - stop SELECT ?x0 ?x1 ?x2 ?x3 ?x4 ?x5 ?x6 ?x7 ?x8 ?x9 ?x10 ?x11 ?x12 ?x13 WHERE {
GRAPH <http://data.bioontology.org/ontologies/NCIT/submissions/1> {
?x0 <http://www.w3.org/2000/01/rdf-schema#subClassOf> <http://ncicb.nci.nih.gov/xml/owl/EVS/Thesaurus.owl#C7057>.
?x1 <http://www.w3.org/2000/01/rdf-schema#subClassOf> ?x0.
?x2 <http://www.w3.org/2000/01/rdf-schema#subClassOf> ?x1.
?x3 <http://www.w3.org/2000/01/rdf-schema#subClassOf> ?x2.
?x4 <http://www.w3.org/2000/01/rdf-schema#subClassOf> ?x3.
?x5 <http://www.w3.org/2000/01/rdf-schema#subClassOf> ?x4.
?x6 <http://www.w3.org/2000/01/rdf-schema#subClassOf> ?x5.
?x7 <http://www.w3.org/2000/01/rdf-schema#subClassOf> ?x6.
?x8 <http://www.w3.org/2000/01/rdf-schema#subClassOf> ?x7.
?x9 <http://www.w3.org/2000/01/rdf-schema#subClassOf> ?x8.
?x10 <http://www.w3.org/2000/01/rdf-schema#subClassOf> ?x9.
?x11 <http://www.w3.org/2000/01/rdf-schema#subClassOf> ?x10.
?x12 <http://www.w3.org/2000/01/rdf-schema#subClassOf> ?x11.
?x13 <http://www.w3.org/2000/01/rdf-schema#subClassOf> ?x12
} } LIMIT 1
ontoportal (backend) - stop SELECT ?x0 ?x1 ?x2 ?x3 ?x4 ?x5 ?x6 ?x7 ?x8 ?x9 ?x10 ?x11 ?x12 WHERE {
GRAPH <http://data.bioontology.org/ontologies/NCIT/submissions/1> {
?x0 <http://www.w3.org/2000/01/rdf-schema#subClassOf> <http://ncicb.nci.nih.gov/xml/owl/EVS/Thesaurus.owl#C7057>.
?x1 <http://www.w3.org/2000/01/rdf-schema#subClassOf> ?x0.
?x2 <http://www.w3.org/2000/01/rdf-schema#subClassOf> ?x1.
?x3 <http://www.w3.org/2000/01/rdf-schema#subClassOf> ?x2.
?x4 <http://www.w3.org/2000/01/rdf-schema#subClassOf> ?x3.
?x5 <http://www.w3.org/2000/01/rdf-schema#subClassOf> ?x4.
?x6 <http://www.w3.org/2000/01/rdf-schema#subClassOf> ?x5.
?x7 <http://www.w3.org/2000/01/rdf-schema#subClassOf> ?x6.
?x8 <http://www.w3.org/2000/01/rdf-schema#subClassOf> ?x7.
?x9 <http://www.w3.org/2000/01/rdf-schema#subClassOf> ?x8.
?x10 <http://www.w3.org/2000/01/rdf-schema#subClassOf> ?x9.
?x11 <http://www.w3.org/2000/01/rdf-schema#subClassOf> ?x10.
?x12 <http://www.w3.org/2000/01/rdf-schema#subClassOf> ?x11
} } LIMIT 1
ontoportal (backend) - stop SELECT ?x0 ?x1 ?x2 ?x3 ?x4 ?x5 ?x6 ?x7 ?x8 ?x9 ?x10 ?x11 WHERE {
GRAPH <http://data.bioontology.org/ontologies/NCIT/submissions/1> {
?x0 <http://www.w3.org/2000/01/rdf-schema#subClassOf> <http://ncicb.nci.nih.gov/xml/owl/EVS/Thesaurus.owl#C7057>.
?x1 <http://www.w3.org/2000/01/rdf-schema#subClassOf> ?x0.
?x2 <http://www.w3.org/2000/01/rdf-schema#subClassOf> ?x1.
?x3 <http://www.w3.org/2000/01/rdf-schema#subClassOf> ?x2.
?x4 <http://www.w3.org/2000/01/rdf-schema#subClassOf> ?x3.
?x5 <http://www.w3.org/2000/01/rdf-schema#subClassOf> ?x4.
?x6 <http://www.w3.org/2000/01/rdf-schema#subClassOf> ?x5.
?x7 <http://www.w3.org/2000/01/rdf-schema#subClassOf> ?x6.
?x8 <http://www.w3.org/2000/01/rdf-schema#subClassOf> ?x7.
?x9 <http://www.w3.org/2000/01/rdf-schema#subClassOf> ?x8.
?x10 <http://www.w3.org/2000/01/rdf-schema#subClassOf> ?x9.
?x11 <http://www.w3.org/2000/01/rdf-schema#subClassOf> ?x10
} } LIMIT 1
ontoportal (backend) - stop SELECT ?x0 ?x1 ?x2 ?x3 ?x4 ?x5 ?x6 ?x7 ?x8 ?x9 ?x10 WHERE {
GRAPH <http://data.bioontology.org/ontologies/NCIT/submissions/1> {
?x0 <http://www.w3.org/2000/01/rdf-schema#subClassOf> <http://ncicb.nci.nih.gov/xml/owl/EVS/Thesaurus.owl#C7057>.
?x1 <http://www.w3.org/2000/01/rdf-schema#subClassOf> ?x0.
?x2 <http://www.w3.org/2000/01/rdf-schema#subClassOf> ?x1.
?x3 <http://www.w3.org/2000/01/rdf-schema#subClassOf> ?x2.
?x4 <http://www.w3.org/2000/01/rdf-schema#subClassOf> ?x3.
?x5 <http://www.w3.org/2000/01/rdf-schema#subClassOf> ?x4.
?x6 <http://www.w3.org/2000/01/rdf-schema#subClassOf> ?x5.
?x7 <http://www.w3.org/2000/01/rdf-schema#subClassOf> ?x6.
?x8 <http://www.w3.org/2000/01/rdf-schema#subClassOf> ?x7.
?x9 <http://www.w3.org/2000/01/rdf-schema#subClassOf> ?x8.
?x10 <http://www.w3.org/2000/01/rdf-schema#subClassOf> ?x9
} } LIMIT 1
ontoportal (backend) - stop SELECT ?x0 ?x1 ?x2 ?x3 ?x4 ?x5 ?x6 ?x7 ?x8 ?x9 WHERE {
GRAPH <http://data.bioontology.org/ontologies/NCIT/submissions/1> {
?x0 <http://www.w3.org/2000/01/rdf-schema#subClassOf> <http://ncicb.nci.nih.gov/xml/owl/EVS/Thesaurus.owl#C7057>.
?x1 <http://www.w3.org/2000/01/rdf-schema#subClassOf> ?x0.
?x2 <http://www.w3.org/2000/01/rdf-schema#subClassOf> ?x1.
?x3 <http://www.w3.org/2000/01/rdf-schema#subClassOf> ?x2.
?x4 <http://www.w3.org/2000/01/rdf-schema#subClassOf> ?x3.
?x5 <http://www.w3.org/2000/01/rdf-schema#subClassOf> ?x4.
?x6 <http://www.w3.org/2000/01/rdf-schema#subClassOf> ?x5.
?x7 <http://www.w3.org/2000/01/rdf-schema#subClassOf> ?x6.
?x8 <http://www.w3.org/2000/01/rdf-schema#subClassOf> ?x7.
?x9 <http://www.w3.org/2000/01/rdf-schema#subClassOf> ?x8
} } LIMIT 1
Ah, it’s the path to root calculations. I think that’s worth a note to Franz to see what they think about the slowness.
addressed by moving metrics calculation to owlapi wrapper https://github.com/ncbo/ontologies_api/releases/tag/v5.24.0 https://github.com/ncbo/owlapi_wrapper/releases/tag/v1.4.0
When using AlegroGraph backend metrics calculations fail for a large number of ontologies. Processing hangs for 3 hours and eventually fails.