ad-freiburg / qlever

Very fast SPARQL Engine, which can handle very large knowledge graphs like the complete Wikidata, offers context-sensitive autocompletion for SPARQL queries, and allows combination with text search. It's faster than engines like Blazegraph or Virtuoso, especially for queries involving large result sets.
Apache License 2.0
424 stars 52 forks source link

confusing messages about "parallel-parsing" : true setting #1600

Open pfps opened 2 weeks ago

pfps commented 2 weeks ago

I'm trying to load Wikidata using the wikidata.settings.json file in the current version of QLever. I'm seeing some confusing messages when the load starts:

2024-11-05 08:46:58.294 - INFO: QLever IndexBuilder, compiled on Mon Nov  4 07:47:39 PM EST 2024 using git hash e52848
2024-11-05 08:46:58.296 - INFO: You specified "locale = en_US" and "ignore-punctuation = 1"
2024-11-05 08:46:58.296 - INFO: You specified "ascii-prefixes-only = true", which enables faster parsing for well-behaved TTL files
2024-11-05 08:46:58.296 - INFO: You specified "parallel-parsing = true", which enables faster parsing for TTL files with a well-behaved use of newlines
2024-11-05 08:46:58.296 - INFO: You specified "num-triples-per-batch = 10,000,000", choose a lower value if the index builder runs out of memory
2024-11-05 08:46:58.296 - INFO: By default, integers that cannot be represented by QLever will throw an exception
2024-11-05 08:46:58.296 - INFO: Processing triples from /dev/stdin ...
2024-11-05 08:46:58.296 - WARN: Parallel parsing set to `true` in the `.settings.json` file; this is deprecated, please use the command-line  option --parse-parallel or -p instead
2024-11-05 08:46:58.296 - INFO: Parsing input triples and creating partial vocabularies, one per batch ...
2024-11-05 09:00:02.723 - INFO: Triples parsed: 1,650,000,000 [average speed 2.1 M/s, last batch 2.0 M/s, fastest 3.4 M/s, slowest 1.9 M/s] 

Should the settings file be changed? In any case, it seems weird to have QLever produce a message about the setting that kind of indicates that the setting works and later say that it won't work.