e5bddab3cb7e4361.tex
1: \begin{abstract}%
2: We propose a projection-free conditional gradient-type algorithm for smooth stochastic multi-level composition optimization, where the objective function is a nested composition of $T$ functions and the constraint set is a closed convex set. Our algorithm assumes access to noisy evaluations of the functions and their gradients, through a stochastic first-order oracle satisfying certain standard unbiasedness and second-moment assumptions. We show that the number of calls to the stochastic first-order oracle and the linear-minimization oracle required by the proposed algorithm, to obtain an $\epsilon$-stationary solution, are of order $\mathcal{O}_T(\epsilon^{-2})$ and $\mathcal{O}_T(\epsilon^{-3})$ respectively, where $\mathcal{O}_T$ hides constants in $T$. Notably, the dependence of these complexity bounds on $\epsilon$ and $T$ are separate in the sense that changing one does not impact the dependence of the bounds on the other. For the case of $T=1$, we also provide a high-probability convergence result that depends poly-logarithmically on the inverse confidence level. Moreover, our algorithm is parameter-free and does not require any (increasing) order of mini-batches to converge unlike the common practice in the analysis of stochastic conditional gradient-type algorithms.
3: %, under standard assumptions (unbiased and bounded second moments) on the stochastic first-order oracle.
4: %in which the number of calls to the stochastic first-order oracle and the linear-minimization oracle, to obtain an $(\epsilon,\delta)$-stationary solution, are of order $\mathcal{O}(\epsilon^{-2}\log^2(1/\delta))$ and $\mathcal{O}(\epsilon^{-3}\log^3(1/\delta))$.
5: \end{abstract}
6: