f8a80f5022ea99c9.tex
1: \begin{abstract}
2:   We study the generalization performance of online learning
3:   algorithms trained on samples coming from a dependent source of data.  We
4:   show that the generalization error of any stable online algorithm
5:   concentrates around its regret---an easily computable statistic of the
6:   online performance of the algorithm---when the underlying ergodic process is
7:   $\beta$- or $\phi$-mixing. We show high probability error bounds assuming
8:   the loss function is convex, and we also establish sharp convergence rates
9:   and deviation bounds for strongly convex losses and several linear
10:   prediction problems such as linear and logistic regression, least-squares
11:   SVM, and boosting on dependent data.  In addition, our results have
12:   straightforward applications to stochastic optimization with dependent data,
13:   and our analysis requires only martingale convergence arguments; we need not
14:   rely on more powerful statistical tools such as empirical process theory.
15: \end{abstract}