9b2e247070484599.tex
1: \begin{abstract}
2:   We study a class of weakly identifiable location-scale mixture
3:   models for which the maximum likelihood estimates based on $n$
4:   i.i.d. samples are known to have lower accuracy than the classical
5:   $n^{- \frac{1}{2}}$ error.  We investigate whether the
6:   Expectation-Maximization (EM) algorithm also converges slowly for
7:   these models.  We first demonstrate via simulation studies a broad
8:   range of over-specified mixture models for which the EM algorithm
9:   converges very slowly, both in one and higher dimensions.  We
10:   provide a complete analytical characterization of this behavior for
11:   fitting data generated from a multivariate standard normal
12:   distribution using two-component Gaussian mixture with varying
13:   location and scale parameters.  Our results reveal distinct regimes
14:   in the convergence behavior of EM as a function of the dimension
15:   $d$. In the multivariate setting ($d \geq 2$), when the covariance
16:   matrix is constrained to a multiple of the identity matrix, the EM
17:   algorithm converges in order $(n/d)^{\frac{1}{2}}$ steps and returns
18:   estimates that are at a Euclidean distance of order ${(n/d)^{-
19:       \frac{1}{4}}}$ and ${ (n d)^{- \frac{1}{2}}}$ from the true
20:   location and scale parameter respectively.  On the other hand, in
21:   the univariate setting ($d = 1$), the EM algorithm converges in
22:   order $n^{\frac{3}{4} }$ steps and returns estimates that are at a
23:   Euclidean distance of order ${ n^{- \frac{1}{8}}}$ and ${ n^{-
24:       \frac{1} {4}}}$ from the true location and scale parameter
25:   respectively.  Establishing the slow rates in the univariate setting
26:   requires a novel localization argument with two stages, with each
27:   stage involving an epoch-based argument applied to a different
28:   surrogate EM operator at the population level.  We also show
29:   multivariate ($d \geq 2$) examples, involving more general
30:   covariance matrices, that exhibit the same slow rates as the
31:   univariate case.
32: \end{abstract}
33: