1cbd4b5f048ae2e4.tex
1: \begin{abstract}
2:   We study a class of convex-concave saddle-point problems of the form
3:   $\min_x\max_y \langle Kx,y\rangle+f_\calP(x)-h^\ast(y)$ where $K$ is
4:   a linear operator, $f_\calP$ is the sum of a convex function $f$
5:   with a Lipschitz-continuous gradient and the indicator function of a
6:   bounded convex polytope $\calP$, and $h^\ast$ is a convex (possibly
7:   nonsmooth) function. Such problem arises, for example, as a
8:   Lagrangian relaxation of various discrete optimization problems. Our
9:   main assumptions are the existence of an efficient {\em linear
10:     minimization oracle} ($\lmo$) for $f_\calP$ and an efficient {\em
11:     proximal map} ($\prox$) for $h^*$ which motivate the solution via
12:   a blend of proximal primal-dual algorithms and Frank-Wolfe
13:   algorithms. In case $h^*$ is the indicator function of a linear
14:   constraint and function $f$ is quadratic, we show a $O(1/n^2)$
15:   convergence rate on the dual objective, requiring $O(n \log n)$
16:   calls of $\lmo$. If the problem comes from the constrained
17:   optimization problem $\min_{x\in\mathbb
18:     R^d}\{f_\calP(x)\:|\:Ax-b=0\}$ then we additionally get bound
19:   $O(1/n^2)$ both on the primal gap and on the infeasibility gap.  In
20:   the most general case, we show a $O(1/n)$ convergence rate of the
21:   primal-dual gap again requiring $O(n\log n)$ calls of $\lmo$. To the
22:   best of our knowledge, this improves on the known convergence rates
23:   for the considered class of saddle-point problems.  We show
24:   applications to labeling problems frequently appearing in machine
25:   learning and computer vision.
26: \end{abstract}
27: