clab / lstm-parser

Transition-based dependency parser based on stack LSTMs
Apache License 2.0
204 stars 63 forks source link

Read zipped word vectors #4

Open danielhers opened 8 years ago

danielhers commented 8 years ago

Currently the -w option allows specifying a word embedding file in text. However, these files may be quite large and are often saved in .gz format, for example. When given a compressed word embedding file, the program should still read it, unzipping it first.

HenrikLovold commented 7 years ago

@danielhers When I run word2vec to generate embeddings, I have the option to save the vectors in binary mode. Does this parser take binary mode vectors, or do I have to save them non-binary?

danielhers commented 7 years ago

@henrik2706 the vectors should be saved as text, just like the pre-trained ones given with the parser: https://drive.google.com/file/d/0B8nESzOdPhLsdWF2S1Ayb1RkTXc/view?usp=sharing