1: \begin{abstract}
2: We consider the problem of principal component analysis (PCA) in a
3: streaming stochastic setting, where our goal is to find a direction of
4: approximate maximal variance, based on a stream of i.i.d. data points in
5: $\reals^d$. A simple and computationally cheap algorithm for this is
6: stochastic gradient descent (SGD), which incrementally updates its estimate
7: based on each new data point. However, due to the non-convex nature of the
8: problem, analyzing its performance has been a challenge. In particular,
9: existing guarantees rely on a non-trivial eigengap assumption on the
10: covariance matrix, which is intuitively unnecessary. In this paper, we
11: provide (to the best of our knowledge) the first eigengap-free convergence
12: guarantees for SGD in the context of PCA. This also partially resolves an
13: open problem posed in \cite{hardt2014noisy}. Moreover, under an eigengap assumption, we show that the same techniques lead to new SGD convergence guarantees with better dependence on the eigengap.
14: \end{abstract}