PTB LM setup - Githubissues

ybisk / HMM-RNN

2 stars 0 forks source link

PTB LM setup #1

Closed janmbuys closed 6 years ago

janmbuys commented 6 years ago

Load PTB data.
Compute perplexity on training and validation data.
Train with truncated backpropagation through time.

janmbuys commented 6 years ago

Update: Implemented the above (as ptb_main.py).

However, we still need to implement standard regularization and optimization techniques to get reasonable results:

First implement Zaremba et al 2014.
Later consider porting AWD-LM.

janmbuys commented 6 years ago

Note that LSTM regularization techniques might not work for HMM as due to the hidden state bottleneck it will have a harder time memorizing the training data.

ybisk commented 6 years ago

I hope there's an error in the perplexity?

janmbuys commented 6 years ago

Update: Implemented dropout and basic SGD optimization strategies.