04c4704765aaab9b.tex
1: \begin{abstract}
2:   We present a \emph{blended conditional gradient} approach for
3:   minimizing a smooth convex function over a polytope $P$, combining
4:   the Frank–Wolfe algorithm (also called conditional gradient) with
5:   gradient-based steps, different from away steps and pairwise steps,
6:   but still achieving linear convergence for strongly convex
7:   functions, along with good practical performance. Our approach
8:   retains all favorable properties of conditional gradient algorithms,
9:   notably avoidance of projections onto $P$ and maintenance of
10:   iterates as sparse convex combinations of a limited number of
11:   extreme points of $P$.  The algorithm is \emph{lazy}, making use of
12:   inexpensive inexact solutions of the linear programming subproblem that
13:   characterizes the conditional gradient approach.  It decreases
14:   measures of optimality rapidly, both in the
15:   number of iterations and in wall-clock time, outperforming even the
16:   lazy conditional gradient algorithms of \cite{braun2016lazifying}.
17:   We also present a streamlined version of the algorithm for
18:   the probability simplex.
19: \end{abstract}
20: