Closed accosmin closed 8 years ago
The weighting scheme does not match the one introduced in the original paper.
May tune the momentum factor for the "averaging" methods (adadelta, adagrad, sia, sga).
Could try to tune momentum like {0.1, 0.2, 0.5, 0.9, 0.95}
The weighting scheme does not match the one introduced in the original paper.