Yonghongwei / Gradient-Centralization

A New Optimization Technique for Deep Neural Networks
533 stars 80 forks source link

Should i use pytorch gradient clippping with gradient centralization? #5

Closed hadaev8 closed 4 years ago

Yonghongwei commented 4 years ago

Yes, you can use Gradient Clipping before GC. It still works.

hadaev8 commented 4 years ago

Thx