1: \begin{abstract}
2: We study a class of weakly identifiable location-scale mixture
3: models for which the maximum likelihood estimates based on $n$
4: i.i.d. samples are known to have lower accuracy than the classical
5: $n^{- \frac{1}{2}}$ error. We investigate whether the
6: Expectation-Maximization (EM) algorithm also converges slowly for
7: these models. We first demonstrate via simulation studies a broad
8: range of over-specified mixture models for which the EM algorithm
9: converges very slowly, both in one and higher dimensions. We
10: provide a complete analytical characterization of this behavior for
11: fitting data generated from a multivariate standard normal
12: distribution using two-component Gaussian mixture with varying
13: location and scale parameters. Our results reveal distinct regimes
14: in the convergence behavior of EM as a function of the dimension
15: $d$. In the multivariate setting ($d \geq 2$), when the covariance
16: matrix is constrained to a multiple of the identity matrix, the EM
17: algorithm converges in order $(n/d)^{\frac{1}{2}}$ steps and returns
18: estimates that are at a Euclidean distance of order ${(n/d)^{-
19: \frac{1}{4}}}$ and ${ (n d)^{- \frac{1}{2}}}$ from the true
20: location and scale parameter respectively. On the other hand, in
21: the univariate setting ($d = 1$), the EM algorithm converges in
22: order $n^{\frac{3}{4} }$ steps and returns estimates that are at a
23: Euclidean distance of order ${ n^{- \frac{1}{8}}}$ and ${ n^{-
24: \frac{1} {4}}}$ from the true location and scale parameter
25: respectively. Establishing the slow rates in the univariate setting
26: requires a novel localization argument with two stages, with each
27: stage involving an epoch-based argument applied to a different
28: surrogate EM operator at the population level. We also show
29: multivariate ($d \geq 2$) examples, involving more general
30: covariance matrices, that exhibit the same slow rates as the
31: univariate case.
32: \end{abstract}
33: