Strategy of α and β decay during training

szagoruyko / attention-transfer

Improving Convolutional Networks via Attention Transfer (ICLR 2017)

1.44k stars 276 forks source link

Strategy of α and β decay during training #33

Open d-li14 opened 6 years ago

d-li14 commented 6 years ago

@szagoruyko @EderSantana Hi, your sharing code is appreciated, but would you please specify your strategy of decaying the two multipliers α and β during training process? Thanks in advance.