Closed vipinpillai closed 6 years ago
Oh, you are right. Just fixed it. This doesn't really affect our results since clipping just makes the attack weaker. Thanks for noticing!
@dtsip Actually, it does make it stronger albeit very marginally. Because the image is not in the valid pixel range, the gradients for the first iteration might not be very meaningful. I did verify the attack results both with and without clipping for the adversarially trained model on CIFAR10. The white-box attack accuracy dropped very slightly from 48.16% to 48.09%. I didn't actually verify the numbers across multiple-restarts though.
Hi,
I noticed that the image which is fed to the model to obtain the gradients for the first iteration of the PGD attack is not clipped to be in the valid image range.
Here, random noise is added to the original image and the resulting image is directly fed to the network for the first iteration without clipping.