abstract:97a6f9e3616eea69.tex

1: \begin{abstract}

2: This work studies an algorithm, which we call magnetic mirror descent, that is inspired by mirror descent and the non-Euclidean proximal gradient algorithm.

3: Our contribution is demonstrating the virtues of magnetic mirror descent as both an equilibrium solver and as an approach to reinforcement learning in two-player zero-sum games.

4: These virtues include:

5: 1)~Being the first quantal response equilibria solver to achieve linear convergence for extensive-form games with first order feedback;

6: 2)~Being the first standard reinforcement learning algorithm to achieve empirically competitive results with CFR in tabular settings;

7: 3)~Achieving favorable performance in 3x3 Dark Hex and Phantom Tic-Tac-Toe as a self-play deep reinforcement learning algorithm.

8: \end{abstract}

9: