8a74a99251acd325.tex
1: \begin{abstract}
2: The distribution regression problem encompasses many important statistics and machine learning tasks,
3: and arises in a large range of applications.
4: %
5: Among various existing approaches to tackle this problem, kernel methods have become a method of choice.
6: %
7: Indeed, kernel distribution regression is both computationally favorable, and supported by a recent learning theory. 
8: %
9: This theory also tackles the two-stage sampling setting, where only samples from the input distributions are available.
10: %
11: In this paper, we improve the learning theory of kernel distribution regression. 
12: %
13: We address kernels based on Hilbertian embeddings, that encompass most, if not all, of the existing approaches.
14: %
15: We introduce the novel near-unbiased condition on the Hilbertian embeddings, that enables us to provide new error bounds on the effect of the two-stage sampling, thanks to a new analysis.
16: %
17: We show that this near-unbiased condition holds for three important classes of kernels, based on optimal transport and mean embedding. 
18: %
19: As a consequence, we strictly improve the existing convergence rates for these kernels.
20: %
21: Our setting and results are illustrated by numerical experiments.
22: \end{abstract}
23: