1: \begin{abstract}
2: We generalize stochastic subgradient descent methods to situations in which
3: we do not receive independent samples from the distribution over which we
4: optimize, instead receiving samples coupled over time. We show
5: that as long as the source of randomness is suitably ergodic---it converges
6: quickly enough to a stationary distribution---the method enjoys strong
7: convergence guarantees, both in expectation and with high probability. This
8: result has implications for stochastic optimization in high-dimensional
9: spaces, peer-to-peer distributed optimization schemes, decision problems
10: with dependent data, and stochastic optimization problems over combinatorial
11: spaces.
12: \end{abstract}
13: