abstract:98ff2cc7536e2ac0.tex

1: \begin{abstract}

2:

3: This paper establishes that an MDP with a unique optimal policy and ergodic associated transition matrix ensures the convergence of various versions of the Value Iteration algorithm at a geometric rate that exceeds the discount factor $\gamma$ for both  discounted and average-reward criteria.

4:

5: \end{abstract}

6: