abstract:7b71a558e681adb9.tex

1: \begin{abstract}

2: A number of important problems in theoretical computer science and machine learning can be interpreted as recovering a certain basis.

3: These include  symmetric matrix eigendecomposition, certain tensor decompositions, Independent Component Analysis (ICA),  spectral clustering and Gaussian mixture learning.

4: Each of these problems reduces to an instance of our general model, which we call a ``Basis Encoding Function" (BEF). We show that learning a basis within this model

5: can then be provably and efficiently achieved using a first order  iteration algorithm (gradient iteration).

6: Our algorithm goes beyond tensor methods while generalizing a number of existing algorithms---e.g., the power method for symmetric matrices, the tensor power iteration for orthogonal decomposable tensors, and cumulant-based FastICA---all within a broader function-based dynamical systems framework.

7: Our framework also unifies the unusual phenomenon observed in these domains that they can be solved using efficient non-convex optimization.

8: Specifically, we describe a class of BEFs such that their local maxima on the unit sphere are in one-to-one correspondence with the basis elements. This description relies on a certain ``hidden convexity" property of these functions.

9:

10: We provide a complete theoretical analysis of the gradient iteration even when the BEF is perturbed.

11: We show convergence and complexity bounds polynomial in dimension and other relevant parameters, such as perturbation size.

12: Our perturbation results can be considered as a  non-linear version of the classical Davis-Kahan theorem for perturbations of eigenvectors of symmetric matrices.

13: In addition we show that   our algorithm exhibits fast (superlinear) convergence and relate the speed of convergence to the properties of the BEF.

14: Moreover, the gradient iteration algorithm can be easily and efficiently implemented in practice.

15: \end{abstract}

16: