Remper / bk2vec

Injecting background knowledge into the word vectors
0 stars 0 forks source link

bk2vec

Injecting background knowledge into the word vectors

How to train word vectors

How to extract texts from the wikipedia dump

java -cp thewikimachine.jar org.fbk.cit.hlt.thewikimachine.xmldump.WikipediaTextExtractor 
-d <path-to-dump.xml>
-o <path-to-output-directory>
-t <amount of threads>