abstract:d72b4d43a73ca14d.tex

1: \begin{abstract}%

2:   Motivated by the training of Generative Adversarial Networks (GANs), we study methods for solving minimax problems with additional nonsmooth regularizers.

3:   We do so by employing \emph{monotone operator} theory, in particular the \emph{Forward-Backward-Forward (FBF)} method, which avoids the known issue of limit cycling by correcting each update by a second gradient evaluation.

4:   Furthermore, we propose a seemingly new scheme which recycles old gradients to mitigate the additional computational cost.

5:   In doing so we rediscover a known method, related to \emph{Optimistic Gradient Descent Ascent (OGDA)}.

6:   For both schemes we prove novel convergence rates for convex-concave minimax problems via a unifying approach. The derived error bounds are in terms of the gap function for the ergodic iterates.

7:   For the deterministic and the stochastic problem we show a convergence rate of $\mathcal{O}(\nicefrac{1}{k})$ and $\mathcal{O}(\nicefrac{1}{\sqrt{k}})$, respectively.

8:   We complement our theoretical results with empirical improvements in the training of Wasserstein GANs on the CIFAR10 dataset.

9: \end{abstract}

10: