dbpedia / lookup

Outputs a list of ranked DBpedia resources for a search string.
Apache License 2.0
185 stars 39 forks source link

Rebuilding the index #12

Closed samhumeau closed 8 years ago

samhumeau commented 9 years ago

Hello everyone,

I am currently rebuilding the index, in order to have the latest data from dbpedia.

I am stuck at the part: "pignlproc output from nerd-stats.pig"

At first, just to know what it is about, what is this file doing? Can the look-up work without this file?

Then, is the nerd-stats.pig a very intensive script that requires a hadoop cluster of several machines? Or is it ok to run it on a laptop?

Thanks you by advance, Sam

sandroacoelho commented 8 years ago

Hi @Cerbal

There are two forms to do it. If you don't feel comfortable with Apache Pig, I recommend you use run Indexer with DBpedia data. Maybe you would like to try this automated script

Best,