abstract:a8ccad4f548c3fe6.tex

1: \begin{abstract}

2:     %

3:

4: We introduce a general framework for competitive gradient-based learning

5: that encompasses a wide breadth of multi-agent learning algorithms, and analyze the limiting behavior of competitive gradient-based learning algorithms using dynamical systems theory. For both general-sum and potential games, we characterize  a

6: non-negligible subset of the local Nash equilibria that will be avoided if each agent employs a gradient-based learning algorithm. We also shed light on the issue of convergence to non-Nash strategies in general- and zero-sum games, which may have no relevance to the underlying game, and arise solely due

7: to the choice of algorithm. The existence and frequency of such strategies may

8: explain some of the difficulties encountered when using gradient descent in

9: zero-sum games as, e.g., in the training of generative adversarial networks. To reinforce the theoretical

10: contributions, we provide empirical results that highlight the frequency of linear quadratic dynamic games (a benchmark for multi-agent reinforcement learning) that admit global Nash equilibria that are almost surely avoided by policy gradient. %

11:

12:  \end{abstract}

13: