d7985571e2517246.tex
1: \begin{abstract}
2: Many existing covariate shift adaptation methods estimate sample weights to be used in the risk estimation in order to mitigate the gap between the source and the target distribution.
3: However, non-parametrically estimating the optimal weights typically involves computationally expensive hyper-parameter tuning that is crucial to the final performance.
4: In this paper, we propose a new non-parametric approach to covariate shift adaptation which avoids estimating weights and has no hyper-parameter to be tuned.
5: Our basic idea is to label unlabeled target data according to the $k$-nearest neighbors in the source dataset.
6: Our analysis indicates that setting $k = 1$ is an optimal choice.
7: Thanks to this property, there is no need to tune any hyper-parameters, unlike other non-parametric methods.
8: Moreover, our method achieves a running time quasi-linear in the sample size with a theoretical guarantee, for the first time in the literature to the best of our knowledge.
9: Our results include sharp rates of convergence for estimating the joint probability distribution of the target data.
10: In particular, the variance of our estimators has the same rate of convergence as for standard parametric estimation despite their non-parametric nature.
11: Our numerical experiments show that proposed method brings drastic reduction in the running time with accuracy comparable to that of the state-of-the-art methods.
12: \end{abstract}
13: