1: \begin{abstract}
2: We consider the high-dimensional discriminant analysis problem. For
3: this problem, different methods have been proposed and justified by
4: establishing exact convergence rates for the classification risk, as
5: well as the $\ell_2$ convergence results to the discriminative
6: rule. However, sharp theoretical analysis for the variable selection
7: performance of these procedures have not been established, even
8: though model interpretation is of fundamental importance in
9: scientific data analysis. This paper bridges the gap by providing
10: sharp sufficient conditions for consistent variable selection using
11: the sparse discriminant analysis \citep{mai2012}. Through careful
12: analysis, we establish rates of convergence that are significantly
13: faster than the best known results and admit an optimal scaling of
14: the sample size $n$, dimensionality $p$, and sparsity level $s$ in
15: the high-dimensional setting. Sufficient conditions are
16: complemented by the necessary information theoretic limits on the
17: variable selection problem in the context of high-dimensional
18: discriminant analysis. Exploiting a numerical equivalence result,
19: our method also establish the optimal results for the ROAD estimator
20: \citep{fan2010road} and the sparse optimal scaling estimator
21: \citep{clemmensen2011sparse}. Furthermore, we analyze an exhaustive
22: search procedure, whose performance serves as a benchmark, and show
23: that it is variable selection consistent under weaker conditions.
24: Extensive simulations demonstrating the sharpness of the bounds are
25: also provided.
26: \end{abstract}
27: