Model overfits with low test accuracy for higher epsilon values

locuslab / fast_adversarial

[ICLR 2020] A repository for extremely fast adversarial training using FGSM

434 stars 92 forks source link

I'm using the FGSM approach to train a ResNet18 model on CIFAR10.

Using the values in the paper for epsilon=8/255 and alpha=10/255 works fine. But when I try to extend to an epsilon of 12 (and an alpha of 1.25*epsilon as outlined in the paper, so 15) to compare to other robust models, the model catastrophically overfits relatively early with very low clean example accuracy (50 to 60%). Has anyone had success using this approach with a higher epsilon than 8/255? Does alpha=1.25*epsilon not apply for other values of epsilon?

Thanks in advance for any help you can provide.

locuslab / fast_adversarial

Model overfits with low test accuracy for higher epsilon values #4