Closed makrai closed 8 years ago
@makrai Nincsen jogtiszta MatLab-om, sajnos a matek intézetnek lejártak a licenszei
megpróbálnád octave-val?
@DavidNemeskey in the paper, write about both the orig Huang et al and CMultiVec
We had a look at the code, and found that
The word representations use a dictionary of 100,232 words. 10 prototypes are used for
6,162 of the words, which roughly correspond to the most frequent words. To determine
which prototype to use given context, run run.m in matlab.
I am writing a mail to Huang to ask him how they arrived to those words.
Would you please also ask how they clustered the occurrences (to compute the prototypes)?
to write in the paper
No reply from Huang thus far, but I think it is not that important anyway. Once @kornai is done with the paper for today, I'll add those two sentences about this MSE.
http://www.socher.org/index.php/Main/ImprovingWordRepresentationsViaGlobalContextAndMultipleWordPrototypes