Open manwu1994 opened 3 years ago
a应该是论文中的scaling factor,也就是代码中的alpha,每层权重应该对应着同一个alpha,将每层的alpha提取出来就可以得到(-1,0,1)这样的权重了。 a is supposed to be the scaling factor in the paper, which is corresponded by alpha in codes. each layer has its own alpha, u can divide weights by alpha to get weights like (-1, 0, 1).
感谢回复!明白了!但是好像不是每一层对应同一个alpha,而是每一个神经元对应一个alpha,因此想问一下您有试过提取alpha参数吗?非常感谢!
感谢回复!明白了!但是好像不是每一层对应同一个alpha,而是每一个神经元对应一个alpha,因此想问一下您有试过提取alpha参数吗?非常感谢!
嗯嗯不好意思我理解有误,是一层对应一个alpha,那如果要获取(-1,0,1)的权重,还得提取delta的参数,对吧?谢谢您的时间!
Hi. I try to get the weight (-1,0,1) based on your code and the TWNs paper, I can't get the (-1,0,1) weight, but can get the (-a,0,a). Could you give me any suggestions? Thank you so much for your time.