553b1e5b03b94096.tex
1: \begin{abstract}
2:   The Expectation Maximization (EM) algorithm is of key importance for
3:   inference in latent variable models including mixture of regressors
4:   and experts, missing observations. This paper introduces a novel EM
5:   algorithm, called \texttt{SPIDER-EM}, for inference from a training
6:   set of size $n$, $n \gg 1$. At the core of our algorithm is an
7:   estimator of the full conditional expectation in the {\sf E}-step,
8:   adapted from the stochastic path-integrated differential estimator
9:   ({\tt SPIDER}) technique.  We derive finite-time complexity bounds
10:   for smooth non-convex likelihood: we show that for convergence to an
11:   $\epsilon$-approximate stationary point, the complexity scales as
12:   $K_{\operatorname{Opt}} (n,\epsilon )={\cal O}(\epsilon^{-1})$ and
13:   $K_{\operatorname{CE}}( n,\epsilon ) = n+ \sqrt{n} {\cal O}(
14:   \epsilon^{-1} )$,
15:   where $K_{\operatorname{Opt}}( n,\epsilon )$ and
16:   $K_{\operatorname{CE}}(n, \epsilon )$ are respectively the number of
17:   {\sf M}-steps and the number of per-sample conditional expectations
18:   evaluations.  This improves over the state-of-the-art
19:   algorithms. Numerical results support our findings.
20: \end{abstract}
21: