8d349c922beb439a.tex
1: \begin{abstract}
2:   We study the impact of the constraint set and gradient geometry on the
3:   convergence of online and stochastic methods for convex optimization,
4:   providing a characterization of the geometries for which stochastic gradient
5:   and adaptive gradient methods are (minimax) optimal. In particular, we show
6:   that when the constraint set is quadratically convex, diagonally
7:   pre-conditioned stochastic gradient methods are minimax optimal. We further
8:   provide a converse that shows that when the constraints are not quadratically
9:   convex---for example, any $\ell_p$-ball for $p < 2$---the methods are far
10:   from optimal. Based on this, we can provide concrete recommendations for when
11:   one should use adaptive, mirror or stochastic gradient methods.
12: \end{abstract}
13: