Open anjishnu opened 9 years ago
I was trying to train a model on a fairly large corpus, 6GB for text (0.98% training, 0.02% validation)
Luckily this Pull Request:
Solved most of my data problems - the only issue arrived during the periodic validations that were running caused my training to error out.
I think the eval split function needs to be modified as well to support chunked evaluation. https://github.com/karpathy/char-rnn/blob/master/train.lua#L186
I'm new to Lua so I still haven't been able to figure out which part of the code is the memory bottleneck in this function.
Hmmm, two things:
I was trying to train a model on a fairly large corpus, 6GB for text (0.98% training, 0.02% validation)
Luckily this Pull Request:
Handle large input data files by reading smaller chunks*
Solved most of my data problems - the only issue arrived during the periodic validations that were running caused my training to error out.
I think the eval split function needs to be modified as well to support chunked evaluation. https://github.com/karpathy/char-rnn/blob/master/train.lua#L186
I'm new to Lua so I still haven't been able to figure out which part of the code is the memory bottleneck in this function.