Closed accuracy-maker closed 1 year ago
Shuffling is for the batches and not for the actual timesteps. For example you have two batches of your time series and let's say the input of the model has to be of size [n_batches, timesteps](No feature dimension for simplicity). What shuffling will do is shuffle the order that the model receives the batches and not the actual timesteps. Lets say we have two batches with 5 timesteps [[1,2,3,4,5],[0,-1,-2,-3,-4]] (n_batches=2,timesteps=5). We can shuffle the order of the batches and have [[0,-1,-2,-3,-4],[1,2,3,4,5]], but we are not shuffling the actual timesteps [[2,5,3,1,4],[-3,0,-2,-1,-4]] because that would hurt the model. This technique can help reduce overfitting.
Thank you for answer this, I understand it!
Thanks for your valuable explanation! @ChuckTG
well done~
Thank you for sharing your great work. I notice the parameter of training shuffle is True. I think it will break the order of time series. I don't understand