evaluating-adversarial-robustness / adv-eval-paper

LaTeX source for the paper "On Evaluating Adversarial Robustness"
https://arxiv.org/abs/1902.06705
249 stars 33 forks source link

Decomposing improvements in accuracy #24

Open yaoshiang opened 4 years ago

yaoshiang commented 4 years ago

For some unpublished work, I have decomposed improvements into this waterfall. Please let me know if this would make sense to include.

Suppose:

Decomposition:

Random Guess: A trivial model that randomly guesses on k=[10] classes would achieve 1/k = [10%] accuracy. The first 9% of accuracy is attributable to a TRIVIAL RANDOM-GUESSING MODEL. This is not a very impressive part of the improvement in accuracy.

Different training seed: M_seed achieves 15% on x_test_adv. The next 5% of improved accuracy is attributable to NON-TRANSFERABILITY. This is also unimpressive.

Defense: M_defended achieves 30% on x_test_adv. Only the final 15% of improvement can be attributed to the defense.

Screen Shot 2020-01-08 at 10 52 27 AM

carlini commented 3 years ago

This is an excellent idea. I've seen a few papers do this in the past year or two and we should definitely include something like this.