udacity / deep-learning-v2-pytorch

Projects and exercises for the latest Deep Learning ND program https://www.udacity.com/course/deep-learning-nanodegree--nd101
MIT License
5.26k stars 5.32k forks source link

Character_Level_RNN_Solution uses future information to predict characters? #418

Open barimani opened 12 months ago

barimani commented 12 months ago

Under recurrent-neural-networks/char-rnn/Character_Level_RNN_Solution.ipynb:

The CharRNN design is using a LSTM layer that is uni-directional however with a fully connected on top of the output sequence, it seems like the information from future predictions are leaked back into the early predictions. The model is then used differently at inference time to make predictions. Is this intended or am I missing something?