1: \begin{abstract}
2: Regret-based algorithms are highly efficient at finding approximate Nash equilibria in sequential games such as poker games.
3: However, most regret-based algorithms, including counterfactual regret minimization (CFR) and its variants, rely on iterate averaging to achieve convergence.
4: Inspired by recent advances on \emph{last-iterate} convergence of optimistic algorithms in zero-sum normal-form games, we study this phenomenon in sequential games, and provide a comprehensive study of last-iterate convergence for zero-sum extensive-form games with perfect recall (EFGs), using various optimistic regret-minimization algorithms over treeplexes.
5: This includes algorithms using the vanilla entropy or squared Euclidean norm regularizers, as well as their dilated versions which admit more efficient implementation.
6: In contrast to CFR, we show that all of these algorithms enjoy last-iterate convergence, with some of them even converging \emph{exponentially} fast.
7: We also provide experiments to further support our theoretical results.
8: \end{abstract}