Open i4never opened 2 years ago
Change slop of learning rate decay or there will be sudden drop after warmup.
Change slop of learning rate decay or there will be sudden drop after warmup.