This repository contains the code for Skip Connections Matter: On the Transferability of Adversarial Examples Generated with ResNets (ICLR 2020 Spotlight).
We propose the Skip Gradient Method (SGM) to generate adversarial examples using gradients more from the skip connections rather than the residual modules. In particular, SGM utilizes a decay factor (gamma) to reduce gradients from the residual modules,
This code is implemented in PyTorch, and we have tested the code under the following environment settings:
./SubImageNet224/
./adv_images/
. For ResNet-152 as the source model,
python attack_sgm.py --gamma 0.2 --output_dir adv_images --arch densenet201 --batch-size 40
For DenseNet-201 as the source model,
python attack_sgm.py --gamma 0.5 --output_dir adv_images --arch resnet152 --batch-size 40
./adv_images/
. For VGG19 with batch norm as the target model
python evaluate.py --input_dir adv_images --arch vgg19_bn
Visualization
Reproduced results
We run this code, and the attack success (1 - acc) against VGG19 is close to the repored in our paper:
Source \ Method | PGD | MI | SGM |
---|---|---|---|
ResNet-152 | 45.80% | 66.70% | 81.04% |
DenseNet-201 | 57.82% | 75.38% | 82.58% |
For easier reproduction, we provide more detailed information here.
In fact, we manipulate gradients flowing through ReLU in utils_sgm, since there is no ReLU in skip-connections:
For ResNet, there are "downsampling" modules in which skip-connections are replaced by a conv layer. We do not manipulate gradients of "downsampling" module;
For DenseNet, we manipulate gradients in all dense block.
All pretrained models in our paper can be found online:
For VGG/ResNet/DenseNet/SENet, we use pretrained models in pretrainedmodels;
For Inception models, we use pretrained models in slim of Tensorflow;
For secured models (e.g. ), they are trained by Ensemble Adversarial Training [2], and pretrained results can be found in adv_imagenet_models.
@inproceedings{wu2020skip,
title={Skip connections matter: On the transferability of adversarial examples generated with resnets},
author={Wu, Dongxian and Wang, Yisen and Xia, Shu-Tao and Bailey, James and Ma, Xingjun},
booktitle={ICLR},
year={2020}
}
[1] Yinpeng Dong, Fangzhou Liao, Tianyu Pang, Hang Su, Jun Zhu, Xiaolin Hu, and Jianguo Li. Boosting adversarial attacks with momentum. In CVPR, 2018.
[2] Florian Tramèr, Alexey Kurakin, Nicolas Papernot, Ian Goodfellow, Dan Boneh, Patrick McDaniel. Ensemble Adversarial Training: Attacks and Defenses. In ICLR, 2018.