2a99ca3da7f72830.tex
1: \begin{abstract}%   <- trailing '%' for backward compatibility of .sty file
2:   This paper establishes the first almost sure convergence rate and the first maximal concentration bound with exponential tails for general contractive stochastic approximation algorithms with Markovian noise.
3:   As a corollary,
4:   we also obtain convergence rates in $L^p$.
5:   Key to our successes is a novel discretization of the mean ODE of stochastic approximation algorithms using intervals with diminishing (instead of constant) length.
6:   As applications,
7:   we provide the first almost sure convergence rate for $Q$-learning with Markovian samples without count-based learning rates.
8:   We also provide the first concentration bound for off-policy temporal difference learning with Markovian samples.
9:   \let\svthefootnote\thefootnote
10:   \let\thefootnote\relax\footnotetext{$^*$ indicates equal contribution.}
11:   \let\thefootnote\svthefootnote
12: \end{abstract}
13: