Open zorrocai opened 5 years ago
I think the scale parameter should be multiplied duiring updating center. Since the scale is a constant to weight center loss, it should influence both data sample and center backward.
如果我想实现mxnet版本多卡的center_loss operator,请问要怎么改这个代码? @zorrocai
这个就是多卡的吧
I think the scale parameter should be multiplied duiring updating center. Since the scale is a constant to weight center loss, it should influence both data sample and center backward.