Open moofin2017 opened 7 years ago
It looks like L-BFGS took a bad step and was unable to recover. Unfortunately my L-BFGS implementation does not include a line search to guard against and reject bad steps. I very rarely saw them in practice. The only answer I can really give with the current implementation is to use Adam instead for these inputs.
Seem to have run into a bug where the loss exploded. See below, step 337:
Using Ubuntu 16.04, Python3.6, CUDNN6.1, CUDA8.0, MKL, GPU.