e9518bfe9d351679.tex
1: \begin{abstract}
2: This is a companion note to our recent study of the weak convergence properties of constrained emphatic temporal-difference learning (ETD) algorithms from a theoretic perspective.
3: It supplements the latter analysis with simulation results and illustrates the behavior of some of the ETD algorithms using three example problems.
4:  \end{abstract}
5: