LSTM network does not improve through training

andrewfry / SharpML-Recurrent

Neural network library (LSTM) based off of RecurrentJs

55 stars 16 forks source link

I replaced the FeedForward network in ExampleXor.cs with a similarly initialized Lstm network from NetworkBuilder. Running this, it can be seen that the network's error does not improve over time.

The Gru and Rnn networks appear to improve over time in the example project.

I am not too familiar yet with the intricacies of LSTMs, so I am taking my time decoding your implementation of GRU and comparing against some articles I am researching. I really appreciate the effort you've made here! It has helped me quite a lot in understanding neural networks and how the fancy maths translate into code. So, thank you! :)

andrewfry / SharpML-Recurrent

LSTM network does not improve through training #3