7b71a558e681adb9.tex
1: \begin{abstract} 
2: A number of important problems in theoretical computer science and machine learning can be interpreted as recovering a certain basis. 
3: These include  symmetric matrix eigendecomposition, certain tensor decompositions, Independent Component Analysis (ICA),  spectral clustering and Gaussian mixture learning.
4: Each of these problems reduces to an instance of our general model, which we call a ``Basis Encoding Function" (BEF). We show that learning a basis within this model 
5: can then be provably and efficiently achieved using a first order  iteration algorithm (gradient iteration).
6: Our algorithm goes beyond tensor methods while generalizing a number of existing algorithms---e.g., the power method for symmetric matrices, the tensor power iteration for orthogonal decomposable tensors, and cumulant-based FastICA---all within a broader function-based dynamical systems framework.
7: Our framework also unifies the unusual phenomenon observed in these domains that they can be solved using efficient non-convex optimization. 
8: Specifically, we describe a class of BEFs such that their local maxima on the unit sphere are in one-to-one correspondence with the basis elements. This description relies on a certain ``hidden convexity" property of these functions.
9: 
10: We provide a complete theoretical analysis of the gradient iteration even when the BEF is perturbed. 
11: We show convergence and complexity bounds polynomial in dimension and other relevant parameters, such as perturbation size. 
12: Our perturbation results can be considered as a  non-linear version of the classical Davis-Kahan theorem for perturbations of eigenvectors of symmetric matrices.
13: In addition we show that   our algorithm exhibits fast (superlinear) convergence and relate the speed of convergence to the properties of the BEF.  
14: Moreover, the gradient iteration algorithm can be easily and efficiently implemented in practice.
15: \end{abstract}
16: