1: \begin{abstract}
2: For high-dimensional classification, it is well known that
3: naively performing the Fisher discriminant rule leads to poor results
4: due to diverging spectra and noise accumulation.
5: Therefore, researchers proposed independence rules to circumvent
6: the diverging spectra, and sparse independence rules to mitigate the issue of noise
7: accumulation. However, in biological applications, there are often a group of correlated genes
8: responsible for clinical outcomes, and the use of the covariance information
9: can significantly reduce misclassification rates. In theory the extent of such error rate reductions is unveiled by comparing the misclassification rates of
10: the Fisher discriminant rule and the independence rule.
11: To materialize the gain based on finite samples,
12: a Regularized Optimal Affine Discriminant (ROAD) is proposed. ROAD
13: selects an increasing number of features as the regularization relaxes.
14: Further benefits can be achieved when a screening method
15: is employed to narrow the feature pool before hitting the ROAD.
16: An efficient Constrained Coordinate Descent algorithm (CCD)
17: is also developed to solve the associated optimization problems. Sampling properties of oracle type are established.
18: Simulation studies and real data analysis
19: support our theoretical results and demonstrate the advantages
20: of the new classification procedure under a variety of correlation structures. A delicate result on continuous piecewise linear solution path for the ROAD optimization problem at the population level justifies the linear interpolation of the CCD algorithm.
21: \end{abstract}