a286e34715ef9727.tex
1: \begin{abstract}
2:  Many statistical estimators are defined as the fixed point of a
3:   data-dependent operator, with estimators based on minimizing a
4:   cost function being an important special case.  The limiting
5:   performance of such estimators depends on the properties of the
6:   population-level operator in the idealized limit of infinitely many
7:   samples.  We develop a general framework that yields bounds on
8:   statistical accuracy based on the interplay between the
9:   deterministic convergence rate of the algorithm at the population
10:   level, and its degree of (in)stability when applied to an empirical
11:   object based on $n$ samples.  Using this framework, we analyze both
12:   stable forms of gradient descent and some higher-order and unstable
13:   algorithms, including Newton's method and its cubic-regularized
14:   variant, as well as the EM algorithm. We provide applications of our
15:   general results to several concrete classes of models, including
16:   Gaussian mixture estimation, non-linear regression models, and informative
17:   non-response models.  We exhibit cases in which an unstable
18:   algorithm can achieve the same statistical accuracy as a stable
19:   algorithm in exponentially fewer steps---namely, with the number of
20:   iterations being reduced from polynomial to logarithmic in sample
21:   size $n$.
22: \end{abstract}
23: