abstract:4566fa7ad15f1389.tex

1: \begin{abstract}

2: Stability issues with reinforcement learning methods persist.

3: To better understand some of these stability and convergence issues involving deep reinforcement learning methods, we examine a simple linear quadratic example.

4: We interpret the convergence criterion of exact Q-learning in the sense of a monotone scheme and discuss consequences of function approximation on monotonicity properties.

5: \end{abstract}

6: