Faster Recurrent Neural Network Language Modeling Toolkit with Noise Contrastive Estimation and Hierarchical Softmax
561
stars
138
forks
source link
Entropy is nan when a vocabulary larger than 2 million words is used #47
Open
fanduo12138 opened 6 years ago