abstract:ef7366ef50b12847.tex

1: \begin{abstract}

2:   It is well known that both gradient descent and

3:   stochastic coordinate descent achieve a global convergence rate of

4:   $O(1/k)$ in the objective value, when applied to a scheme for

5:   minimizing a Lipschitz-continuously differentiable, unconstrained

6:   convex function.  In this work, we improve this rate to $o(1/k)$.

7:   We extend the result to proximal gradient and proximal coordinate

8:   descent on regularized problems to show similar $o(1/k)$ convergence

9:   rates. The result is tight in the sense that a rate of

10:   $O(1/k^{1+\epsilon})$ is not generally attainable for any

11:   $\epsilon>0$, for any of these methods.

12:

13: \keywords{ Gradient descent methods \and Coordinate descent methods \and Proximal

14: gradient methods \and Convex optimization \and

15: Complexity  }

16: \end{abstract}

17: