abstract:9b2e247070484599.tex

1: \begin{abstract}

2:   We study a class of weakly identifiable location-scale mixture

3:   models for which the maximum likelihood estimates based on $n$

4:   i.i.d. samples are known to have lower accuracy than the classical

5:   $n^{- \frac{1}{2}}$ error.  We investigate whether the

6:   Expectation-Maximization (EM) algorithm also converges slowly for

7:   these models.  We first demonstrate via simulation studies a broad

8:   range of over-specified mixture models for which the EM algorithm

9:   converges very slowly, both in one and higher dimensions.  We

10:   provide a complete analytical characterization of this behavior for

11:   fitting data generated from a multivariate standard normal

12:   distribution using two-component Gaussian mixture with varying

13:   location and scale parameters.  Our results reveal distinct regimes

14:   in the convergence behavior of EM as a function of the dimension

15:   $d$. In the multivariate setting ($d \geq 2$), when the covariance

16:   matrix is constrained to a multiple of the identity matrix, the EM

17:   algorithm converges in order $(n/d)^{\frac{1}{2}}$ steps and returns

18:   estimates that are at a Euclidean distance of order ${(n/d)^{-

19:       \frac{1}{4}}}$ and ${ (n d)^{- \frac{1}{2}}}$ from the true

20:   location and scale parameter respectively.  On the other hand, in

21:   the univariate setting ($d = 1$), the EM algorithm converges in

22:   order $n^{\frac{3}{4} }$ steps and returns estimates that are at a

23:   Euclidean distance of order ${ n^{- \frac{1}{8}}}$ and ${ n^{-

24:       \frac{1} {4}}}$ from the true location and scale parameter

25:   respectively.  Establishing the slow rates in the univariate setting

26:   requires a novel localization argument with two stages, with each

27:   stage involving an epoch-based argument applied to a different

28:   surrogate EM operator at the population level.  We also show

29:   multivariate ($d \geq 2$) examples, involving more general

30:   covariance matrices, that exhibit the same slow rates as the

31:   univariate case.

32: \end{abstract}

33: