Open BeingGod opened 7 months ago
The usage and description of loss-scale is inconsistent. The argument of loss-scale expect to get a number of positive power of 2 but ConstantGradScaler set loss-scale to real scale directly rather than 2**loss-scale.
loss-scale
ConstantGradScaler
2**loss-scale
Argument Description:
Argument Usage:
Could you help me review this PR ? @tjruwase
The usage and description of
loss-scale
is inconsistent. The argument ofloss-scale
expect to get a number of positive power of 2 butConstantGradScaler
setloss-scale
to real scale directly rather than2**loss-scale
.Argument Description:
Argument Usage: