1: \begin{abstract}
2: We study the generalization performance of online learning
3: algorithms trained on samples coming from a dependent source of data. We
4: show that the generalization error of any stable online algorithm
5: concentrates around its regret---an easily computable statistic of the
6: online performance of the algorithm---when the underlying ergodic process is
7: $\beta$- or $\phi$-mixing. We show high probability error bounds assuming
8: the loss function is convex, and we also establish sharp convergence rates
9: and deviation bounds for strongly convex losses and several linear
10: prediction problems such as linear and logistic regression, least-squares
11: SVM, and boosting on dependent data. In addition, our results have
12: straightforward applications to stochastic optimization with dependent data,
13: and our analysis requires only martingale convergence arguments; we need not
14: rely on more powerful statistical tools such as empirical process theory.
15: \end{abstract}