3ad95b888973580c.tex
1: \begin{abstract} 
2: The graduated optimization approach, also known as the continuation method, is a popular heuristic to solving non-convex problems that has received renewed interest over the last decade. %with the advent of non-convex optimization for learning deep networks.  
3: Despite its popularity, very little is known in terms of theoretical convergence analysis.
4:  
5: In this paper we describe a new first-order algorithm based on graduated optimization and analyze its performance. We characterize a parameterized family of non-convex functions for which this algorithm provably converges to a {\bf global} optimum. In particular, we prove that the algorithm  converges to an $\eps$-approximate solution within $O(1 /  \eps^2)$ gradient-based steps. 
6: We extend our algorithm and  analysis  to the setting of stochastic non-convex optimization with  noisy gradient feedback, attaining the same convergence rate.
7: Additionally, we discuss the  setting of ``zero-order optimization", and devise a a variant of our algorithm  which converges at rate of  $O(d^2/  \eps^4)$.
8:  
9:  \end{abstract}