1: \begin{abstract}
2: It is well known that both gradient descent and
3: stochastic coordinate descent achieve a global convergence rate of
4: $O(1/k)$ in the objective value, when applied to a scheme for
5: minimizing a Lipschitz-continuously differentiable, unconstrained
6: convex function. In this work, we improve this rate to $o(1/k)$.
7: We extend the result to proximal gradient and proximal coordinate
8: descent on regularized problems to show similar $o(1/k)$ convergence
9: rates. The result is tight in the sense that a rate of
10: $O(1/k^{1+\epsilon})$ is not generally attainable for any
11: $\epsilon>0$, for any of these methods.
12:
13: \keywords{ Gradient descent methods \and Coordinate descent methods \and Proximal
14: gradient methods \and Convex optimization \and
15: Complexity }
16: \end{abstract}
17: