abstract:553b1e5b03b94096.tex

1: \begin{abstract}

2:   The Expectation Maximization (EM) algorithm is of key importance for

3:   inference in latent variable models including mixture of regressors

4:   and experts, missing observations. This paper introduces a novel EM

5:   algorithm, called \texttt{SPIDER-EM}, for inference from a training

6:   set of size $n$, $n \gg 1$. At the core of our algorithm is an

7:   estimator of the full conditional expectation in the {\sf E}-step,

8:   adapted from the stochastic path-integrated differential estimator

9:   ({\tt SPIDER}) technique.  We derive finite-time complexity bounds

10:   for smooth non-convex likelihood: we show that for convergence to an

11:   $\epsilon$-approximate stationary point, the complexity scales as

12:   $K_{\operatorname{Opt}} (n,\epsilon )={\cal O}(\epsilon^{-1})$ and

13:   $K_{\operatorname{CE}}( n,\epsilon ) = n+ \sqrt{n} {\cal O}(

14:   \epsilon^{-1} )$,

15:   where $K_{\operatorname{Opt}}( n,\epsilon )$ and

16:   $K_{\operatorname{CE}}(n, \epsilon )$ are respectively the number of

17:   {\sf M}-steps and the number of per-sample conditional expectations

18:   evaluations.  This improves over the state-of-the-art

19:   algorithms. Numerical results support our findings.

20: \end{abstract}

21: