Open ZhouHaoWa opened 1 year ago
because 1000 can create a isotropic
because 1000 can create a isotropic
could you pleasr tell me what does "isotropic" mean? Thanks!
just make gradient desent faster
It's common to scale the loss by a constant factor to control the magnitude of gradients during training. Large gradients can lead to unstable training, so scaling the loss down can help stabilize the optimization process.
loss = trainer(x_0).sum() / 1000. why is 1000?