abstract:2fc6ad6657152bac.tex

1: \begin{abstract}

2: In this paper we demonstrate a simple heuristic adaptive restart technique that can

3: dramatically improve the convergence rate of accelerated gradient schemes.  The

4: analysis of the technique relies on the observation that these schemes exhibit

5: two modes of behavior depending on how much momentum is applied. In what we

6: refer to as the `high momentum' regime the iterates generated by an accelerated

7: gradient scheme exhibit a periodic behavior, where the period is

8: proportional to the square root of the local condition number of the objective

9: function.  This suggests a restart technique whereby we reset the momentum

10: whenever

11: we observe periodic behavior. We provide analysis to show that in many cases

12: adaptively restarting allows us to recover the optimal rate of convergence

13: with no prior knowledge of function parameters.

14: \iffalse

15: We also demonstrate that functions that are well approximated by a quadratic

16: around their optima can enter a region of linear convergence, even if they

17: are not globally strongly convex.

18: It has been shown that periodically restarting an accelerated smooth first

19: order convex optimization algorithm can improve the convergence rate.  In this

20: paper we present two \emph{adaptive} restarting techniques for such

21: algorithms.  We refer to the techniques as adaptive because restarts are made

22: not at fixed intervals but when some condition on the iterates is satisfied.

23: We demonstrate that within a region near the optimum an adaptive restart

24: algorithm can achieve linear convergence, much better than that guaranteed by

25: the accelerated algorithms alone. In addition, the restart technique is able to

26: automatically exploit local function properties and therefore can

27: sometimes obtain a better convergence rate than that predicted by global

28: function parameters.

29: \fi

30: \end{abstract}

31: