stanfordnlp / GloVe

Software in C and data files for the popular GloVe model for distributed word representations, a.k.a. word vectors or embeddings
Apache License 2.0
6.86k stars 1.51k forks source link

why sort when calculating cooccur? #38

Closed PengFoo closed 8 years ago

PengFoo commented 8 years ago

i noticed that there are several qsort in the cooccur.c, while next step is shuffling the cooccur. i wonder what the sort code is used to.

ghost commented 8 years ago

I think this just establishes a canonical ordering for convenience. I've been a little slow to reply as I can't be 100% certain this is the case :). Sorting is not mentioned in the paper.