1: \begin{abstract}
2: A number of important problems in theoretical computer science and machine learning can be interpreted as recovering a certain basis.
3: These include symmetric matrix eigendecomposition, certain tensor decompositions, Independent Component Analysis (ICA), spectral clustering and Gaussian mixture learning.
4: Each of these problems reduces to an instance of our general model, which we call a ``Basis Encoding Function" (BEF). We show that learning a basis within this model
5: can then be provably and efficiently achieved using a first order iteration algorithm (gradient iteration).
6: Our algorithm goes beyond tensor methods while generalizing a number of existing algorithms---e.g., the power method for symmetric matrices, the tensor power iteration for orthogonal decomposable tensors, and cumulant-based FastICA---all within a broader function-based dynamical systems framework.
7: Our framework also unifies the unusual phenomenon observed in these domains that they can be solved using efficient non-convex optimization.
8: Specifically, we describe a class of BEFs such that their local maxima on the unit sphere are in one-to-one correspondence with the basis elements. This description relies on a certain ``hidden convexity" property of these functions.
9:
10: We provide a complete theoretical analysis of the gradient iteration even when the BEF is perturbed.
11: We show convergence and complexity bounds polynomial in dimension and other relevant parameters, such as perturbation size.
12: Our perturbation results can be considered as a non-linear version of the classical Davis-Kahan theorem for perturbations of eigenvectors of symmetric matrices.
13: In addition we show that our algorithm exhibits fast (superlinear) convergence and relate the speed of convergence to the properties of the BEF.
14: Moreover, the gradient iteration algorithm can be easily and efficiently implemented in practice.
15: \end{abstract}
16: