ac0740258c4abdae.tex
1: \begin{abstract}
2: % Blackwell approachability has been a key ingredient in practical algorithms for solving large-scale extensive-form games, through the use of Blackwell approachability algorithms for the \emph{simplex}, and the counterfactual regret minimization framework.
3: In this paper, we introduce the first algorithmic framework for Blackwell approachability on the sequence-form polytope, the class of convex polytopes capturing the strategies of players in extensive-form games (EFGs).
4: This leads to a new class of regret-minimization algorithms that are stepsize-invariant, in the same sense as the regret matching and regret matching$^+$ algorithms for the simplex.
5: % In this paper, we introduce a novel algorithmic framework for solving two-player zero-sum extensive-form games, based on the self-play framework and on Blackwell approachability directly applied to the treeplexes of each player. 
6: Our modular framework can be combined with any existing regret minimizer over cones to compute a Nash equilibrium in two-player zero-sum EFGs with perfect recall, through the self-play framework. Leveraging predictive online mirror descent, we introduce {\em Predictive Treeplex Blackwell$^+$} (\ptbp), and show a $O(1/\sqrt{T})$ convergence rate to Nash equilibrium in self-play. We then show how to stabilize \ptbp{} with a stepsize, resulting in an algorithm with a state-of-the-art $O(1/T)$ convergence rate. 
7: We provide an extensive set of experiments to compare our framework with several algorithmic benchmarks, including \cfrp{} and its predictive variant, and we highlight interesting connections between practical performance and the stepsize-dependence or stepsize-invariance properties of classical algorithms.
8: \end{abstract}
9: