In visualize.py file,
I see the learning_rate is 10000,it confused me!why doing this,
also,In the optimization procedure,The sentence "caffe_data = caffe_data + learning_rate*diff " seems not used any L2 Paradigm,but in the paper, It is not like this!
thanks a lot!
In visualize.py file, I see the learning_rate is 10000,it confused me!why doing this, also,In the optimization procedure,The sentence "caffe_data = caffe_data + learning_rate*diff " seems not used any L2 Paradigm,but in the paper, It is not like this! thanks a lot!