Closed karandeep-johar closed 5 years ago
I'll look into this, but that code is old and discouraged: the last time the corpus dataset was built was maybe 8 years ago, and that dataset no longer matches the current dictionaries, and is unlikely to yield accurate rankings. Of course, you could build a new dataset, but this would take a lot of work, and the results might not be worth the effort. If you want to do research, then yes, do this, and I would encourage that. If you merely want to use it for something, then ... no.
Flip-side is that a regenerated database might help with opencog issues, for example, opencog/opencog#2193 -- parse ranking of "Bob lives in China" (although that issue is worked around in commit 6780d33c45653551f582600bb57272416b4dbae6 )
Closing, as this code has been removed (from 5.7.0).
I followed the instructions on the page which asks to run
./configure --enable-corpus-stats
and then make which gives me the errors.I tried to remove them by including the prerequisite files and also by changing the definition of
double lg_corpus_disjunct_score(Linkage linkage, int w)
todouble lg_corpus_disjunct_score(Linkage linkage, WordIdx w)
.I managed to compile it. But when I tried to run it. I got a seg fault. Can you please look into the issue?