Closed osivaz61 closed 1 year ago
Random for most standard networks, for ResNet models with the same Kaiming initialization that ResNet follows. We did not explicitly optimize to find optimal initialization for the weights.
Random for most standard networks, for ResNet models with the same Kaiming initialization that ResNet follows. We did not explicitly optimize to find optimal initialization for the weights.