eclipse-rdf4j / rdf4j

Eclipse RDF4J: scalable RDF for Java
https://rdf4j.org/
BSD 3-Clause "New" or "Revised" License
365 stars 163 forks source link

Allow configuration of threshold for using SPARQL validation approach when validation sh:maxCount #5108

Closed hmottestad closed 3 months ago

hmottestad commented 3 months ago

When validation sh:maxCount there are two possible validation approaches for bulk validation. The first option is to generate a SPARQL query that returns all the invalid data, the second is to use the transactional validation approach with fallback bulk validation. This second option uses a lot of memory and is typically very slow, but at some point the SPARQL query actually becomes even slower. This point is set to 5 in the code, but for performance testing and optimisation it would be useful to change this using a java system property.