yandex / faster-rnnlm

Faster Recurrent Neural Network Language Modeling Toolkit with Noise Contrastive Estimation and Hierarchical Softmax
Other
561 stars 138 forks source link

Probably bug in valid/text entropies in doc/RESULTS.md #52

Open drtonyr opened 4 years ago

drtonyr commented 4 years ago

doc/RESULTS.md contains "Validation entropy" and "Test entropy" for a range of experiments on the "One Billion Word Benchmark"

The "Validation entropy" number is exactly the same as the "Test entropy" number, so it looks as though the same dataset was used to compute both (they should be news.en.heldout-00000-of-00050 and news.en.heldout-00001-of-00050 respectfully).