1: \begin{abstract}%
2: We present \textit{Matrix Krasulina}, an algorithm for online $k$-PCA, by generalizing the classic Krasulina's method \citep{krasulina1969} from vector to matrix case.
3: We show, both theoretically and empirically, that the algorithm naturally adapts to data low-rankness and converges exponentially fast to the ground-truth principal subspace.
4: Notably, our result suggests that despite various recent efforts to accelerate the convergence of stochastic-gradient based methods by adding a $O(n)$-time variance reduction step, for the $k$-PCA problem, a truly online SGD variant suffices to achieve exponential convergence on intrinsically low-rank data.
5: \end{abstract}
6: