1: \begin{abstract}
2: The Expectation-Maximization (EM) algorithm
3: \citep{demp:lair:rubi:1977} is a popular method for computing
4: maximum likelihood estimates (MLEs) in problems with missing data.
5: Each iteration of the algorithm formally consists of an E-step:
6: evaluate the expected complete-data log-likelihood given the
7: observed data, with expectation taken at current parameter estimate;
8: and an M-step: maximize the resulting expression to find the updated
9: estimate. Conditions that guarantee convergence of the EM sequence
10: to a unique MLE were found by \citet{boyl:1983} and \citet{wu:1983}.
11: In complicated models for high-dimensional data, it is common to
12: encounter an intractable integral in the E-step. The Monte Carlo EM
13: algorithm of \citet{wei:tann:1990} works around this difficulty by
14: maximizing instead a Monte Carlo approximation to the appropriate
15: conditional expectation. Convergence properties of Monte Carlo EM
16: have been studied, most notably, by \citet{chan:ledo:1995} and
17: \citet{fort:moul:2003a}.
18:
19: The goal of this review paper is to provide an accessible but
20: rigorous introduction to the convergence properties of EM and Monte
21: Carlo EM. No previous knowledge of the EM algorithm is assumed. We
22: demonstrate the implementation of EM and Monte Carlo EM in two
23: simple but realistic examples. We show that if the EM algorithm
24: converges it converges to a stationary point of the likelihood, and
25: that the rate of convergence is linear at best. For Monte Carlo EM
26: we present a readable proof of the main result of
27: \citet{chan:ledo:1995}, and state without proof the conclusions of
28: \citet{fort:moul:2003a}. An important practical implication of
29: \pcite{fort:moul:2003a} result relates to the determination of Monte
30: Carlo sample sizes in MCEM; we provide a brief review of the
31: literature \citep{boot:hobe:1999, caff:jank:jone:2005} on that
32: problem.
33: \end{abstract}