abstract:c1f07aa4469ade66.tex

1: \begin{abstract}

2: In finite mixture models, apart from underlying mixing measure, true kernel density function

3: of each subpopulation in the data is, in many scenarios, unknown. Perhaps the most popular

4: approach is to choose some kernel functions that we empirically believe our data are

5: generated from and use these kernels to fit our models. Nevertheless, as long as the

6: chosen kernel and the true kernel are different, statistical inference of mixing measure

7: under this setting will be highly unstable. To overcome this challenge, we propose flexible

8: and efficient robust estimators of the mixing measure in these models, which are inspired

9: by the idea of minimum Hellinger distance estimator, model selection criteria, and

10: superefficiency phenomenon. We demonstrate that our estimators consistently recover the

11: true number of components and achieve the optimal convergence rates of parameter

12: estimation under both the well- and mis-specified kernel settings for any fixed bandwidth.

13: These desirable asymptotic properties are illustrated via careful simulation studies with

14: both synthetic and real data.

15: \footnote{This research is supported in part by grants

16: NSF CCF-1115769, NSF CAREER DMS-1351362, and NSF CNS-1409303 to XN. YR gratefully acknowledges the partial support from NSF DMS-1712962.}

17:

18: AMS 2000 subject classification: Primary 62F15, 62G05; secondary 62G20.

19:

20: Keywords and phrases: model misspecification, convergence rates, mixture models, Fisher singularities, strong identifiability, minimum distance estimator, model selection, superefficiency, Wasserstein distances.

21: \end{abstract}

22: