ef7366ef50b12847.tex
1: \begin{abstract}
2:   It is well known that both gradient descent and
3:   stochastic coordinate descent achieve a global convergence rate of
4:   $O(1/k)$ in the objective value, when applied to a scheme for
5:   minimizing a Lipschitz-continuously differentiable, unconstrained
6:   convex function.  In this work, we improve this rate to $o(1/k)$.
7:   We extend the result to proximal gradient and proximal coordinate
8:   descent on regularized problems to show similar $o(1/k)$ convergence
9:   rates. The result is tight in the sense that a rate of
10:   $O(1/k^{1+\epsilon})$ is not generally attainable for any
11:   $\epsilon>0$, for any of these methods.
12: 
13: \keywords{ Gradient descent methods \and Coordinate descent methods \and Proximal
14: gradient methods \and Convex optimization \and
15: Complexity  }
16: \end{abstract}
17: