abstract:8a74a99251acd325.tex

1: \begin{abstract}

2: The distribution regression problem encompasses many important statistics and machine learning tasks,

3: and arises in a large range of applications.

4: %

5: Among various existing approaches to tackle this problem, kernel methods have become a method of choice.

6: %

7: Indeed, kernel distribution regression is both computationally favorable, and supported by a recent learning theory.

8: %

9: This theory also tackles the two-stage sampling setting, where only samples from the input distributions are available.

10: %

11: In this paper, we improve the learning theory of kernel distribution regression.

12: %

13: We address kernels based on Hilbertian embeddings, that encompass most, if not all, of the existing approaches.

14: %

15: We introduce the novel near-unbiased condition on the Hilbertian embeddings, that enables us to provide new error bounds on the effect of the two-stage sampling, thanks to a new analysis.

16: %

17: We show that this near-unbiased condition holds for three important classes of kernels, based on optimal transport and mean embedding.

18: %

19: As a consequence, we strictly improve the existing convergence rates for these kernels.

20: %

21: Our setting and results are illustrated by numerical experiments.

22: \end{abstract}

23: