fca888de34954a62.tex
1: \begin{abstract}
2: Regret Matching$^+$ (RM$^+$) and its variants are important algorithms for solving large-scale games \citep{tammelin2014solving}.
3: However, a theoretical understanding of their success in practice is still a mystery.
4: Moreover, recent advances \citep{syrgkanis2015fast} on fast convergence in games are limited to no-regret algorithms such as online mirror descent, which satisfy \emph{stability}.
5: In this paper, we first give counterexamples showing that RM$^+$ 
6: and its predictive version~\citep{farina2021faster} can be unstable, which might cause other players to suffer large regret.
7: We then provide two fixes: restarting and chopping off the positive orthant that \rmp\ works in.
8: We show that these fixes are sufficient to get $O(T^{1/4})$ individual regret and $O(1)$ social regret in normal-form games via RM$^+$ with predictions.
9: We also apply our stabilizing techniques to clairvoyant updates in the uncoupled learning setting for RM$^+$ and prove desirable results akin to recent works for Clairvoyant online mirror descent~\citep{piliouras2021optimal,farina2022clairvoyant}. 
10: Our experiments show the advantages of our algorithms over vanilla RM$^+$-based algorithms in matrix and extensive-form games. 
11: \end{abstract}
12: