Open 5g4s opened 1 year ago
We show that large models are more robust to compression techniques such as quantization and pruning than small models. Heavily compressed, large models achieve higher accuracy than lightly compressed, small models.
As models become increasingly large, they contain small subnetworks which achieve high accuracy.
https://arxiv.org/abs/2002.11794