2bd17157159723c5.tex
1: \begin{abstract}
2: A novel algorithm named {\tt Perturbed} {\tt Prox-Preconditioned
3:   SPIDER (3P-SPIDER)} is introduced. It is a stochastic
4: variance-reduced proximal-gradient type algorithm built on {\tt
5:   Stochastic Path Integral Differential EstimatoR} (SPIDER), an
6: algorithm known to achieve near-optimal first-order oracle inequality
7: for nonconvex and nonsmooth optimization. Compared to the vanilla
8: prox-SPIDER, \texttt{3P-SPIDER} uses preconditioned gradient
9: estimators. Preconditioning can either be applied "explicitly" to a
10: gradient estimator or be introduced "implicitly" as in applications to
11: the EM algorithm.  \texttt{3P-SPIDER} also assumes that the
12: preconditioned gradients may (possibly) be not known in closed
13: analytical form and therefore must be approximated which adds an
14: additional degree of perturbation. Studying the convergence in
15: expectation, we show that \texttt{3P-SPIDER} achieves a near-optimal
16: oracle inequality $O(n^{1/2} /\epsilon)$ where $n$ is the number of
17: observations and $\epsilon$ the target precision even when the
18: gradient is estimated by Monte Carlo methods. We illustrate the
19: algorithm on an application to the minimization of a penalized
20: empirical loss.
21: \end{abstract}
22: