wojciechz / learning_to_execute

Learning to Execute
Apache License 2.0
480 stars 115 forks source link

Added dropout, replaced sgd with adam optimizer #4

Closed ghost closed 8 years ago

ghost commented 9 years ago

These changes seem to speed up training, (measured by number of epochs rather than wall clock time), in my experiments.

Please note only tested without :cuda() / cutorch.