rlebret / hpca

C++ implementation of the Hellinger PCA for computing word embeddings.
GNU General Public License v3.0
32 stars 5 forks source link

co-occurrence matrix #1

Open HaniehP opened 8 years ago

HaniehP commented 8 years ago

Hi, Thanks for your nice code. Could you also please upload the co-occurrence matrix? It is not provided here http://lebret.ch/words/

Thanks

rlebret commented 8 years ago

The co-occurrence matrix is computed with the 'cooccurrence' tool. This tool needs a corpus text file as input, along with the vocabulary file (which is computed with the 'vocab' tool).