1: \begin{abstract}
2:
3: We study the problem of recovering the subspace spanned by the first $k$ principal components of $d$-dimensional data
4: under the streaming setting, with a memory bound of $\mathcal{O}(kd)$. Two families of algorithms are known for this
5: problem. The first family is based on the framework of stochastic gradient descent. Nevertheless, the convergence rate of the family can be seriously affected by the learning rate of the descent steps and deserves more serious study.
6: The second
7: family is based on the power method over blocks of data, but setting the block size for its existing algorithms is not
8: an easy task. In this paper, we analyze the convergence rate of a representative algorithm with decayed learning
9: rate~\citep{oja} in the first family for the general $k>1$ case. Moreover, we propose a novel algorithm for the second
10: family that sets the block sizes automatically and dynamically with faster convergence rate. We then conduct empirical
11: studies that fairly compare the two families on real-world data. The studies reveal the advantages and disadvantages of
12: these two families.
13:
14: \end{abstract}
15: