Open danielhers opened 8 years ago
@danielhers When I run word2vec to generate embeddings, I have the option to save the vectors in binary mode. Does this parser take binary mode vectors, or do I have to save them non-binary?
@henrik2706 the vectors should be saved as text, just like the pre-trained ones given with the parser: https://drive.google.com/file/d/0B8nESzOdPhLsdWF2S1Ayb1RkTXc/view?usp=sharing
Currently the
-w
option allows specifying a word embedding file in text. However, these files may be quite large and are often saved in.gz
format, for example. When given a compressed word embedding file, the program should still read it, unzipping it first.