abstract:e1307324e3788243.tex

1: \begin{abstract}

2: We consider the problem of principal component analysis (PCA) in a

3: streaming stochastic setting, where our goal is to find a direction of

4: approximate maximal variance, based on a stream of i.i.d. data points in

5: $\reals^d$. A simple and computationally cheap algorithm for this is

6: stochastic gradient descent (SGD), which incrementally updates its estimate

7: based on each new data point. However, due to the non-convex nature of the

8: problem, analyzing its performance has been a challenge. In particular,

9: existing guarantees rely on a non-trivial eigengap assumption on the

10: covariance matrix, which is intuitively unnecessary. In this paper, we

11: provide (to the best of our knowledge) the first eigengap-free convergence

12: guarantees for SGD in the context of PCA. This also partially resolves an

13: open problem posed in \cite{hardt2014noisy}. Moreover, under an eigengap assumption, we show that the same techniques lead to new SGD convergence guarantees with better dependence on the eigengap.

14: \end{abstract}