1: \begin{abstract}
2: We study the impact of the constraint set and gradient geometry on the
3: convergence of online and stochastic methods for convex optimization,
4: providing a characterization of the geometries for which stochastic gradient
5: and adaptive gradient methods are (minimax) optimal. In particular, we show
6: that when the constraint set is quadratically convex, diagonally
7: pre-conditioned stochastic gradient methods are minimax optimal. We further
8: provide a converse that shows that when the constraints are not quadratically
9: convex---for example, any $\ell_p$-ball for $p < 2$---the methods are far
10: from optimal. Based on this, we can provide concrete recommendations for when
11: one should use adaptive, mirror or stochastic gradient methods.
12: \end{abstract}
13: