abstract:e376a0bd14903c6a.tex

1: \begin{abstract}%   <- trailing '%' for backward compatibility of .sty file

2: We consider reinforcement learning with performance evaluated by a dynamic risk measure.

3: We construct a projected risk-averse dynamic programming equation and study its properties. Then

4: we propose risk-averse counterparts of  the methods of temporal differences  and we prove their convergence with probability one.

5: We also perform an empirical study on a complex transportation problem.\\

6: \noindent

7: \emph{Keywords:}{ Reinforcement Learning, Risk, Stochastic Approximation}\\

8: \emph{AMS:}

9:   49L20, 62L20, 90C39

10: \end{abstract}

11: