1: \begin{abstract}
2: The Expectation Maximization (EM) algorithm is of key importance for
3: inference in latent variable models including mixture of regressors
4: and experts, missing observations. This paper introduces a novel EM
5: algorithm, called \texttt{SPIDER-EM}, for inference from a training
6: set of size $n$, $n \gg 1$. At the core of our algorithm is an
7: estimator of the full conditional expectation in the {\sf E}-step,
8: adapted from the stochastic path-integrated differential estimator
9: ({\tt SPIDER}) technique. We derive finite-time complexity bounds
10: for smooth non-convex likelihood: we show that for convergence to an
11: $\epsilon$-approximate stationary point, the complexity scales as
12: $K_{\operatorname{Opt}} (n,\epsilon )={\cal O}(\epsilon^{-1})$ and
13: $K_{\operatorname{CE}}( n,\epsilon ) = n+ \sqrt{n} {\cal O}(
14: \epsilon^{-1} )$,
15: where $K_{\operatorname{Opt}}( n,\epsilon )$ and
16: $K_{\operatorname{CE}}(n, \epsilon )$ are respectively the number of
17: {\sf M}-steps and the number of per-sample conditional expectations
18: evaluations. This improves over the state-of-the-art
19: algorithms. Numerical results support our findings.
20: \end{abstract}
21: