d72b4d43a73ca14d.tex
1: \begin{abstract}%
2:   Motivated by the training of Generative Adversarial Networks (GANs), we study methods for solving minimax problems with additional nonsmooth regularizers.
3:   We do so by employing \emph{monotone operator} theory, in particular the \emph{Forward-Backward-Forward (FBF)} method, which avoids the known issue of limit cycling by correcting each update by a second gradient evaluation.
4:   Furthermore, we propose a seemingly new scheme which recycles old gradients to mitigate the additional computational cost.
5:   In doing so we rediscover a known method, related to \emph{Optimistic Gradient Descent Ascent (OGDA)}.
6:   For both schemes we prove novel convergence rates for convex-concave minimax problems via a unifying approach. The derived error bounds are in terms of the gap function for the ergodic iterates.
7:   For the deterministic and the stochastic problem we show a convergence rate of $\mathcal{O}(\nicefrac{1}{k})$ and $\mathcal{O}(\nicefrac{1}{\sqrt{k}})$, respectively.
8:   We complement our theoretical results with empirical improvements in the training of Wasserstein GANs on the CIFAR10 dataset.
9: \end{abstract}
10: