Closed horiacristescu closed 6 years ago
There is no compatibility between .bin models and gensim. The functionalities to generate the .vec are not hard to add but you would get only unigrams. Without modifying the source code, you can generate a .vec file by first using a command such as:
./fasttext print-sentence-vectors model.bin < vocabulary.txt
And then merging the output with the tokens in vocabulary.
Here again you cannot get bigram embeddings.
There is no compatibility between .bin models and gensim. The functionalities to generate the .vec are not hard to add but you would get only unigrams. Without modifying the source code, you can generate a .vec file by first using a command such as:
./fasttext print-sentence-vectors model.bin < vocabulary.txt
And then merging the output with the tokens in vocabulary.
Here again you cannot get bigram embeddings.
Thanks for this , how to download both biprams and unigrams ?
I am trying to load the ".bin" model file in gensim (v3.3.0) from sent2vec, but I get this error:
I looked for the plain text format (.vec) of the model but I can't find it, I presume sent2vec doesn't generate it.
I also tried "./fasttext print-word-vectors model.bin" but it just hangs.
How can I use the vectors in gensim?