stanfordnlp / GloVe

Software in C and data files for the popular GloVe model for distributed word representations, a.k.a. word vectors or embeddings
Apache License 2.0
6.81k stars 1.51k forks source link

how to train my own vectors if corpus is 70G? #201

Closed JiaoPaner closed 2 years ago

JiaoPaner commented 2 years ago

Does the corpus data need to be loaded into memory all at once?