1: \begin{abstract}
2: This paper studies $\ell_1$ regularization with high-dimensional
3: features for support vector machines with a~built-in reject option
4: (meaning that the decision of classifying an observation can be
5: withheld at a cost lower than that of misclassification). The procedure
6: can be conveniently implemented as a linear program and computed using
7: standard software. We prove that the minimizer of the penalized
8: population risk favors sparse solutions and show that the behavior of
9: the empirical risk minimizer mimics that of the population risk
10: minimizer. We also introduce a notion of classification complexity and
11: prove that our minimizers adapt to the unknown complexity. Using a
12: novel oracle inequality for the excess risk, we identify situations
13: where fast rates of convergence occur.
14: \end{abstract}