1: \begin{abstract}% <- trailing '%' for backward compatibility of .sty file
2: We consider reinforcement learning with performance evaluated by a dynamic risk measure.
3: We construct a projected risk-averse dynamic programming equation and study its properties. Then
4: we propose risk-averse counterparts of the methods of temporal differences and we prove their convergence with probability one.
5: We also perform an empirical study on a complex transportation problem.\\
6: \noindent
7: \emph{Keywords:}{ Reinforcement Learning, Risk, Stochastic Approximation}\\
8: \emph{AMS:}
9: 49L20, 62L20, 90C39
10: \end{abstract}
11: