1: \begin{abstract}
2: In this paper, we prove some convergence results of a special case of optimistic policy iteration algorithm for stochastic shortest path problem mentioned in \cite{Ts03} . We consider both Monte Carlo and $TD(\lambda)$ methods for the policy evaluation step under the condition that termination state will eventually be reached almost surely.
3: \end{abstract}
4: