1: \begin{abstract}
2: In finite mixture models, apart from underlying mixing measure, true kernel density function
3: of each subpopulation in the data is, in many scenarios, unknown. Perhaps the most popular
4: approach is to choose some kernel functions that we empirically believe our data are
5: generated from and use these kernels to fit our models. Nevertheless, as long as the
6: chosen kernel and the true kernel are different, statistical inference of mixing measure
7: under this setting will be highly unstable. To overcome this challenge, we propose flexible
8: and efficient robust estimators of the mixing measure in these models, which are inspired
9: by the idea of minimum Hellinger distance estimator, model selection criteria, and
10: superefficiency phenomenon. We demonstrate that our estimators consistently recover the
11: true number of components and achieve the optimal convergence rates of parameter
12: estimation under both the well- and mis-specified kernel settings for any fixed bandwidth.
13: These desirable asymptotic properties are illustrated via careful simulation studies with
14: both synthetic and real data.
15: \footnote{This research is supported in part by grants
16: NSF CCF-1115769, NSF CAREER DMS-1351362, and NSF CNS-1409303 to XN. YR gratefully acknowledges the partial support from NSF DMS-1712962.}
17:
18: AMS 2000 subject classification: Primary 62F15, 62G05; secondary 62G20.
19:
20: Keywords and phrases: model misspecification, convergence rates, mixture models, Fisher singularities, strong identifiability, minimum distance estimator, model selection, superefficiency, Wasserstein distances.
21: \end{abstract}
22: