abstract:1827d9b881792ebf.tex

1: \begin{abstract}

2: \noindent \textbf{Abstract}: Sparse principal component analysis

3: (PCA) is an important technique for dimensionality reduction of high-dimensional

4: data. However, most existing sparse PCA algorithms are based on non-convex

5: optimization, which provide little guarantee on the global convergence.

6: Sparse PCA algorithms based on a convex formulation, for example the

7: Fantope projection and selection (FPS), overcome this difficulty,

8: but are computationally expensive. In this work we study sparse PCA

9: based on the convex FPS formulation, and propose a new algorithm that

10: is computationally efficient and applicable to large and high-dimensional

11: data sets. Nonasymptotic and explicit bounds are derived for both

12: the optimization error and the statistical accuracy, which can be

13: used for testing and inference problems. We also extend our algorithm

14: to online learning problems, where data are obtained in a streaming

15: fashion. The proposed algorithm is applied to high-dimensional gene

16: expression data for the detection of functional gene groups.\medskip{}

17:

18: \noindent \textbf{Keywords}: sparse principal component analysis,

19: dimensionality reduction, convex optimization, gradient method, online

20: learning.

21: \end{abstract}