abstract:d1a89e01501d5adc.tex

1: \begin{abstract}

2: In this paper, we propose~$\GDA$, a distributed optimization method to solve saddle point problems of the form:~${\min_{\mb{x}} \max_{\mb{y}} \left\{ F(\mb x,\mb y) :=G(\mb x) + \langle \mb y, \ol{P} \mb x \rangle - H(\mb y) \right\}}$, where the functions~$G(\cdot)$,~$H(\cdot)$, and the the coupling matrix~$\ol{P}$ are distributed over a strongly connected network of nodes.~$\GDA$ is a first-order method that uses gradient tracking to eliminate the dissimilarity caused by heterogeneous data distribution among the nodes. In the most general form,~$\GDA$ includes a consensus over the local coupling matrices to achieve the optimal (unique) saddle point, however, at the expense of increased communication. To avoid this, we propose a more efficient variant~$\GDAl$ that does not incur the additional communication and analyze its convergence in various scenarios. We show that~$\GDA$ converges linearly to the unique saddle point solution when~$G$ is smooth and convex,~$H$ is smooth and strongly convex, and the global coupling matrix~$\ol{P}$ has full column rank. We further characterize the regime under which~$\GDA$ exhibits a network topology-independent convergence behavior. We next show the linear convergence of~$\GDAl$ to an error around the unique saddle point, which goes to zero when the coupling cost~${\langle \mb y, \ol{P} \mb x \rangle}$ is common to all nodes, or when~$G$ and~$H$ are quadratic. Numerical experiments illustrate the convergence properties and importance of~$\GDA$ and~$\GDAl$ for several applications.

3:

4: \begin{IEEEkeywords}

5: Decentralized optimization, saddle point problems, constrained optimization, descent ascent methods.

6: \end{IEEEkeywords}

7: \end{abstract}

8: