erfanhatefi / Pruning-by-eXplaining-in-PyTorch

Pruning By Explaining Revisited: Optimizing Attribution Methods to Prune CNNs and Transformers, Paper accepted at eXCV workshop of ECCV 2024
https://arxiv.org/abs/2408.12568
8 stars 0 forks source link

Pruning By Explaining Revisited: Optimizing Attribution Methods to Prune CNNs and Transformers

arXiv

Accepted at eXCV Workshop of ECCV 2024


Abstract

To solve ever more complex problems, Deep Neural Networks are scaled to billions of parameters, leading to huge computational costs. An effective approach to reduce computational requirements and increase efficiency is to prune unnecessary components of these often over-parameterized networks. Previous work has shown that attribution methods from the field of eXplainable AI serve as effective means to extract and prune the least relevant network components in a few-shot fashion. We extend the current state by proposing to explicitly optimize hyperparameters of attribution methods for the task of pruning, and further include transformer-based networks in our analysis. Our approach yields higher model compression rates of large transformer- and convolutional architectures (VGG, ResNet, ViT) compared to previous works, while still attaining high performance on ImageNet classification tasks. Here, our experiments indicate that transformers have a higher degree of over-parameterization compared to convolutional neural networks.

Attribution-based pruning workflow


Code is going to be published actively!


Citation

Feel free to cite us, in case you have used our work:

@misc{hatefi2024pruningexplainingrevisitedoptimizing,
      title={Pruning By Explaining Revisited: Optimizing Attribution Methods to Prune CNNs and Transformers}, 
      author={Sayed Mohammad Vakilzadeh Hatefi and Maximilian Dreyer and Reduan Achtibat and Thomas Wiegand and Wojciech Samek and Sebastian Lapuschkin},
      year={2024},
      eprint={2408.12568},
      archivePrefix={arXiv},
      primaryClass={cs.AI},
      url={https://arxiv.org/abs/2408.12568}
}