174486dd7ce86260.tex
1: \begin{abstract}  
2: 	In this paper, we prove some convergence results of a special case of optimistic policy iteration algorithm for stochastic shortest path problem mentioned in \cite{Ts03} . We consider both Monte Carlo  and $TD(\lambda)$ methods for the policy evaluation step under the condition that termination state will eventually be reached almost surely. 
3: \end{abstract}
4: