abstract:d7985571e2517246.tex

1: \begin{abstract}

2: Many existing covariate shift adaptation methods estimate sample weights to be used in the risk estimation in order to mitigate the gap between the source and the target distribution.

3: However, non-parametrically estimating the optimal weights typically involves computationally expensive hyper-parameter tuning that is crucial to the final performance.

4: In this paper, we propose a new non-parametric approach to covariate shift adaptation which avoids estimating weights and has no hyper-parameter to be tuned.

5: Our basic idea is to label unlabeled target data according to the $k$-nearest neighbors in the source dataset.

6: Our analysis indicates that setting $k = 1$ is an optimal choice.

7: Thanks to this property, there is no need to tune any hyper-parameters, unlike other non-parametric methods.

8: Moreover, our method achieves a running time quasi-linear in the sample size with a theoretical guarantee, for the first time in the literature to the best of our knowledge.

9: Our results include sharp rates of convergence for estimating the joint probability distribution of the target data.

10: In particular, the variance of our estimators has the same rate of convergence as for standard parametric estimation despite their non-parametric nature.

11: Our numerical experiments show that proposed method brings drastic reduction in the running time with accuracy comparable to that of the state-of-the-art methods.

12: \end{abstract}

13: