Prune a model while finetuning or training.
394
stars
58
forks
source link
Layer2NoNorm uses same mix and delta value during eval mode as during training (useful for debugging or analysis of Layer2NoNorm transition) #13
Closed
echarlaix closed 3 years ago
Perfect !