CLARIAH / burgerLinker

Command line tool for linking civil registries
MIT License
13 stars 1 forks source link

process does not seem to stop when merging HDT files #13

Closed rlzijdeman closed 3 years ago

rlzijdeman commented 3 years ago

Having run the HDT merge function I get the following output after the HDT file is created: Generating header HDT file joint in: 3 min 24 sec 702 ms 490 us FINISHED: HDT merged and saved at derived//merged-dataset.hdt - Total runtime: 213.6 seconds (3.56 minutes) START: Generating HDT index... log4j:WARN No appenders could be found for logger (org.rdfhdt.hdt.triples.impl.PredicateIndexArray). log4j:WARN Please initialize the log4j system properly. log4j:WARN See http://logging.apache.org/log4j/1.2/faq.html#noconfig for more info. The process continues to run while displaying the above, so I think the process doesn't end properly?

raadjoe commented 3 years ago

These warnings are not relevant (just some warnings related to the use of libraries for logging). The code should be still running then, running the longest step of the process which is generating the HDT index file. There is no progress bar or logging when generating the index file, so nothing to do other than waiting for the 'finished message' :)

rlzijdeman commented 3 years ago

@rlzijdeman check again with --nohub

rlzijdeman commented 3 years ago

in the end this issue appeared trying to link only on a single observation, not relation.