abstract:ddc1ffc50bed9ce8.tex

1: \begin{abstract}

2: Machine learning classifiers with high test accuracy often perform

3: poorly under adversarial attacks.

4: It is commonly believed that

5: %, i.e. they have high \emph{robust} error.

6: adversarial training %is commonly believed to

7: alleviates this issue.

8: %effectively decrease the robust error.

9: In this paper, we demonstrate that,

10: surprisingly, the opposite may be true --- Even though adversarial training helps when enough data is  available, it may hurt robust generalization in the small sample size regime.

11: %We show that adversarial training

12: %with perceptible attacks can hurt robust generalization on

13: We first prove this phenomenon for a high-dimensional linear

14: classification setting with noiseless observations. Our proof provides explanatory insights that may also transfer to feature learning models.

15: %Specifically, when SGD on the robust logistic loss is run until convergence,

16: %Specifically we show that the robust error of the robust max-margin solution monotonically increases with increasing training perturbation

17: %strength set size $\epsilon$, starting from standard training ($\epsilon =

18: %0$).

19: %In particular, this drop is more pronounced for small sample sizes.

20: Further, we observe in experiments on standard image datasets that the same behavior occurs %in the small sample size regime

21: for perceptible attacks

22: that effectively reduce class information such as mask attacks and object corruptions.

23: %This paper provides an example how common beliefs may need to be revisited

24: \end{abstract}

25: