Open JustinAsdz opened 3 years ago
I can't figure it out why all values will converge to 2 ( the hyper-parameter γ).
Do you mean using different γ like 1, 2, 4, 8? or all the value will be 2 at the end of the training process?
I can't figure it out why all values will converge to 2 ( the hyper-parameter γ).