1: \begin{abstract}
2: We study a class of convex-concave saddle-point problems of the form
3: $\min_x\max_y \langle Kx,y\rangle+f_\calP(x)-h^\ast(y)$ where $K$ is
4: a linear operator, $f_\calP$ is the sum of a convex function $f$
5: with a Lipschitz-continuous gradient and the indicator function of a
6: bounded convex polytope $\calP$, and $h^\ast$ is a convex (possibly
7: nonsmooth) function. Such problem arises, for example, as a
8: Lagrangian relaxation of various discrete optimization problems. Our
9: main assumptions are the existence of an efficient {\em linear
10: minimization oracle} ($\lmo$) for $f_\calP$ and an efficient {\em
11: proximal map} ($\prox$) for $h^*$ which motivate the solution via
12: a blend of proximal primal-dual algorithms and Frank-Wolfe
13: algorithms. In case $h^*$ is the indicator function of a linear
14: constraint and function $f$ is quadratic, we show a $O(1/n^2)$
15: convergence rate on the dual objective, requiring $O(n \log n)$
16: calls of $\lmo$. If the problem comes from the constrained
17: optimization problem $\min_{x\in\mathbb
18: R^d}\{f_\calP(x)\:|\:Ax-b=0\}$ then we additionally get bound
19: $O(1/n^2)$ both on the primal gap and on the infeasibility gap. In
20: the most general case, we show a $O(1/n)$ convergence rate of the
21: primal-dual gap again requiring $O(n\log n)$ calls of $\lmo$. To the
22: best of our knowledge, this improves on the known convergence rates
23: for the considered class of saddle-point problems. We show
24: applications to labeling problems frequently appearing in machine
25: learning and computer vision.
26: \end{abstract}
27: