In this post it is mentioned one heuristic to set learning rate is to look at the ratio between the weight update norms and the weight norms and try to adjust the learning rate such that this is about 1/1000. Weight norms are already monitored by default by pylearn2 but we have no way of monitoring the gradient updates at the moment. It would be useful to add monitoring channels for this purpose.
In this post it is mentioned one heuristic to set learning rate is to look at the ratio between the weight update norms and the weight norms and try to adjust the learning rate such that this is about 1/1000. Weight norms are already monitored by default by
pylearn2
but we have no way of monitoring the gradient updates at the moment. It would be useful to add monitoring channels for this purpose.