1: \begin{abstract}
2: Stochastic-approximation gradient methods are attractive for
3: large-scale convex optimization because they offer inexpensive
4: iterations. They are especially popular in data-fitting and
5: machine-learning applications where the data arrives in a continuous
6: stream, or it is necessary to minimize large sums of functions. It
7: is known that by appropriately decreasing the variance of the error
8: at each iteration, the expected rate of convergence matches that of
9: the underlying deterministic gradient method. Conditions are given
10: under which this happens with overwhelming probability.
11: \end{abstract}
12: