Closed zeyu-liu closed 4 years ago
Why use an independent meta networks to generate the weights for each block, in stead of using the weights of the original networks and update them directly?
Because the accuracy is low due to the excessive weight sharing. You can refer to figure.8 in the paper.
Why use an independent meta networks to generate the weights for each block, in stead of using the weights of the original networks and update them directly?