cemoody / lda2vec

MIT License
3.15k stars 627 forks source link

Fix incompatibility with spacy>=0.100 #11

Closed abelsonlive closed 8 years ago

abelsonlive commented 8 years ago

See: https://github.com/cemoody/lda2vec/issues/10

syllog1sm commented 8 years ago

I hope this fixes the issue for you, and lets you update.

Version v0.100 includes much faster loading time, and makes it easy to load the GloVe vectors, instead of the Wikipedia-trained ones we've been shipping by default.

To install the GloVe vectors:

$ sputnik —name spacy install en
$ sputnik --name spacy install en_glove_cc_300_1m_vectors

Then in Python, load the en model with the GloVe vectors like so:

import spacy

nlp = spacy.load('en', vectors='en_glove_cc_300_1m_vectors')

You can also call sputnik from within Python, if you prefer to have your program install the data resources on first load, instead of requiring explicit setup.

cemoody commented 8 years ago

My apologies for neglecting this PR, I just now noticed! I'll take a look.

cemoody commented 8 years ago

Thanks for the PR! I don't think this is necessary any more since the DATA_DIR was removed anyhow. Thank you though!