abstract:1d3a06b3c89a057e.tex

1: \begin{abstract}

2:         Most existing results about \emph{last-iterate convergence} of learning dynamics are limited to two-player zero-sum games, and only apply under rigid assumptions about what dynamics the players follow.

3:     %

4:     In this paper we provide new results and techniques that apply to broader families of games and learning dynamics.

5:     %

6:     %OLD First, we establish that the length of the trajectories for dynamics such as \emph{optimistic mirror descent (OMD)} is bounded in a game class that includes polymatrix and strategically zero-sum games.

7:     First, we use a regret-based analysis to show that in a class of games that includes constant-sum polymatrix and strategically zero-sum games, dynamics such as \emph{optimistic mirror descent (OMD)} have \emph{bounded second-order path lengths}, a property which holds even when players employ different algorithms and prediction mechanisms. This enables us to obtain $O(1/\sqrt{T})$ rates and optimal $O(1)$ regret bounds.

8:     %

9:     Our analysis also reveals a surprising property: % for the rich class of \emph{smooth games} (Roughgarden JACM\,`15):

10:     OMD either reaches arbitrarily close to a Nash equilibrium, or it outperforms the \emph{robust price of anarchy} in efficiency.

11:     %

12:     Moreover, for potential games we establish convergence to an $\epsilon$-equilibrium after $O(1/\epsilon^2)$ iterations for mirror descent under a broad class of regularizers, as well as optimal $O(1)$ regret bounds for OMD variants. Our framework also extends to near-potential games, and unifies known analyses for distributed learning in Fisher's market model. Finally, we analyze the convergence, efficiency, and robustness of \emph{optimistic gradient descent (OGD)} in general-sum continuous games.

13: \end{abstract}

14: