c3b2f9aa1a09f33e.tex
1: \begin{abstract}
2:   Stochastic-approximation gradient methods are attractive for
3:   large-scale convex optimization because they offer inexpensive
4:   iterations. They are especially popular in data-fitting and
5:   machine-learning applications where the data arrives in a continuous
6:   stream, or it is necessary to minimize large sums of functions. It
7:   is known that by appropriately decreasing the variance of the error
8:   at each iteration, the expected rate of convergence matches that of
9:   the underlying deterministic gradient method. Conditions are given
10:   under which this happens with overwhelming probability.
11: \end{abstract}
12: