abstract:174486dd7ce86260.tex

1: \begin{abstract}

2: 	In this paper, we prove some convergence results of a special case of optimistic policy iteration algorithm for stochastic shortest path problem mentioned in \cite{Ts03} . We consider both Monte Carlo  and $TD(\lambda)$ methods for the policy evaluation step under the condition that termination state will eventually be reached almost surely.

3: \end{abstract}

4: