abstract:a323be25bbc47d5e.tex

1: \begin{abstract}

2:   We study convergence properties of Stochastic Gradient Descent (SGD)

3:   for convex objectives without assumptions on smoothness or strict

4:   convexity. We consider the question of establishing that with high

5:   probability the objective evaluated at the

6:   candidate minimizer returned by SGD is close to the minimal value of

7:   the objective. We compare this result concerning the final candidate

8:   minimzer (i.e.~the final model parameters learned after all gradient

9:   steps) to the online learning techniques of

10:   \cite{zinkevich-online-sgd} that take a rolling average of the model

11:   parameters at the different steps of SGD.

12: \end{abstract}

13: