Open myeghaneh opened 3 years ago
any idea? :)
I am facing the same issue. Let me know if you've been able to solve it.
To run scripts/03_glove_build_counts.py
successfully, make sure you do the following and pass the correct build folder of GloVe:
git submodule add https://github.com/stanfordnlp/GloVe.git
)cd GloVe && make
, which will make a GloVe/build
directory. Go back to the parent directory (cd ..
).03_glove_build_counts.py
GloVe directory path you pass the build folder GloVe/build
as follows: python scripts/03_glove_build_counts.py GloVe/build source_folder output_folder
This is basically described in the script 03_glove_build_counts.py
line 20-28 comments.
I followed your step to train my own S2V for my corpus on my customized NER model, thill step 2 everything is fine,.
corpusMODELV05.spacy is made and also corpusMODELV05-1.s2v
but in step 3 I faced with this error
I am working on Win 10 machine and have used this version of the glove
Wikipedia 2014 + Gigaword 5 (6B tokens, 400K vocab, uncased, 50d, 100d, 200d, & 300d vectors, 822 MB download): glove.6B.zip
https://nlp.stanford.edu/projects/glove/
it seems the number of VOC in
glove.6B.200d.txt/vocab_count is not in line with something
can someone help me ?
many thanks in advance