hyperparameters - Githubissues

openai / generating-reviews-discovering-sentiment

Code for "Learning to Generate Reviews and Discovering Sentiment"

MIT License

1.51k stars 380 forks source link

Hi yair,

The setting nsteps is only used here for test-time feature extraction - it's how many steps are processed on the GPU for each sess.run call. This can be set to any length and has no impact on the computed result. Since this implementation does padding to that length - smaller is more efficient to reduce overall padding, but if you go too small then you lose efficiency due too many sess.run calls and the overhead of using feed dicts. Just picked 64 as a good middle ground.

Nstates let's it know that the lstm has two separate state vectors (cell and hidden). Nembd is the dimensionality of the embedding layer (64) used to represent the input bytes.

openai / generating-reviews-discovering-sentiment

hyperparameters #28