MS20190155 / Measuring-Corporate-Culture-Using-Machine-Learning

Code Repository for MS20190155
141 stars 97 forks source link

Confusion about the dictionary #9

Open JJery-web opened 1 year ago

JJery-web commented 1 year ago

Thanks to this project for the opportunity to learn. I encountered 2 confusions while building the dictionary and would like answers. Thanks.

  1. I set N_WORDS_DIM=500, but the final number of words I get in expanded_dict.csv is 520. Why is the final number of words not equal to 500?

2, I set N_WORDS_DIM=300 and 500 respectively, and found that the first 300 words of the two results are not exactly the same (I think if the seed words are not changed, the first 300 words should theoretically be exactly the same based on the cos method to rank) I am also confused by this question.

I look forward to receiving a reply! Thanks!