NELSONZHAO / zhihu

This repo contains the source code in my personal column (https://zhuanlan.zhihu.com/zhaoyeyu), implemented using Python 3.6. Including Natural Language Processing and Computer Vision projects, such as text generation, machine translation, deep convolution GAN and other actual combat code.
https://zhuanlan.zhihu.com/zhaoyeyu
3.5k stars 2.14k forks source link

word2vec的subsample问题 #28

Open yin-hong opened 5 years ago

yin-hong commented 5 years ago

你好!请问一下word2vec源代码中,subsample高频率单词(如the,a等),是在训练之前subsample吗?还是在训练过程中subsample? 我看作者的源代码好像是在训练过程中subsample,求解答