ybisk / HMM-RNN

2 stars 0 forks source link

PTB LM setup #1

Closed janmbuys closed 6 years ago

janmbuys commented 6 years ago
janmbuys commented 6 years ago

Update: Implemented the above (as ptb_main.py).

However, we still need to implement standard regularization and optimization techniques to get reasonable results:

janmbuys commented 6 years ago

Note that LSTM regularization techniques might not work for HMM as due to the hidden state bottleneck it will have a harder time memorizing the training data.

ybisk commented 6 years ago

I hope there's an error in the perplexity?

image

janmbuys commented 6 years ago

Update: Implemented dropout and basic SGD optimization strategies.