ad-freiburg / qlever

Very fast SPARQL Engine, which can handle very large knowledge graphs like the complete Wikidata, offers context-sensitive autocompletion for SPARQL queries, and allows combination with text search. It's faster than engines like Blazegraph or Virtuoso, especially for queries involving large result sets.
Apache License 2.0
417 stars 52 forks source link

Add graph info to block metadata #1482

Closed joka921 closed 1 month ago

joka921 commented 2 months ago

If a block contains triples from at most MAX_NUM_GRAPHS_STORED_IN_BLOCK_METADATA (default value: 20) different graphs, store the set of these graphs along with the block metadata. Also store the binary information whether the block contains two identical triples belonging to different graphs. Given a set of graphs (which eventually will come from a query via the FROM or GRAPH clause) then use this metadata to determine whether a block contains relevant triples at all, or whether all the triples in the block are relevant, or whether the block needs to be filtered and or triples need to be deduplicated.

This is an index-breaking change (it changes the binary format of some of the index files).

codecov[bot] commented 1 month ago

Codecov Report

All modified and coverable lines are covered by tests :white_check_mark:

Project coverage is 94.33%. Comparing base (bed8b29) to head (c540e64). Report is 2 commits behind head on master.

Additional details and impacted files ```diff @@ Coverage Diff @@ ## master #1482 +/- ## ========================================== + Coverage 94.30% 94.33% +0.02% ========================================== Files 350 351 +1 Lines 25920 26052 +132 Branches 3486 3506 +20 ========================================== + Hits 24445 24577 +132 Misses 1433 1433 Partials 42 42 ```

:umbrella: View full report in Codecov by Sentry.
:loudspeaker: Have feedback on the report? Share it here.

sonarcloud[bot] commented 1 month ago

Quality Gate Passed Quality Gate passed

Issues
5 New issues
0 Accepted issues

Measures
0 Security Hotspots
0.0% Coverage on New Code
0.0% Duplication on New Code

See analysis details on SonarCloud