1: \begin{abstract}
2: The graduated optimization approach, also known as the continuation method, is a popular heuristic to solving non-convex problems that has received renewed interest over the last decade. %with the advent of non-convex optimization for learning deep networks.
3: Despite its popularity, very little is known in terms of theoretical convergence analysis.
4:
5: In this paper we describe a new first-order algorithm based on graduated optimization and analyze its performance. We characterize a parameterized family of non-convex functions for which this algorithm provably converges to a {\bf global} optimum. In particular, we prove that the algorithm converges to an $\eps$-approximate solution within $O(1 / \eps^2)$ gradient-based steps.
6: We extend our algorithm and analysis to the setting of stochastic non-convex optimization with noisy gradient feedback, attaining the same convergence rate.
7: Additionally, we discuss the setting of ``zero-order optimization", and devise a a variant of our algorithm which converges at rate of $O(d^2/ \eps^4)$.
8:
9: \end{abstract}