56b28992a0b45714.tex
1: \begin{abstract}
2: %This work is concerned with the study of theoretical properties of 
3: Deep Gaussian processes 
4: have recently been  proposed as natural objects to fit, similarly to deep neural networks, possibly complex features present %yet frequently encountered 
5:  in modern data samples, such as compositional structures. 
6: Adopting a Bayesian nonparametric approach, it is natural to use deep Gaussian processes as prior distributions, and  use the corresponding posterior distributions for statistical inference.  We introduce the deep
7: Horseshoe Gaussian process   \textsf{Deep--HGP}, a new simple prior based on deep Gaussian processes with a squared-exponential kernel, that in particular enables data-driven choices of the key lengthscale parameters. For nonparametric regression with random design, we show that the associated tempered posterior distribution recovers the unknown true regression curve optimally in terms of  quadratic loss, up to a logarithmic factor, in an adaptive way. The convergence rates are {\em simultaneously} adaptive to both the smoothness of the regression function and to its structure in terms of compositions. The dependence of the rates in terms of dimension are explicit, allowing in particular for  input spaces of dimension increasing with the number of observations.
8: %Further, the \textsf{Deep--HGP} automatically adapts to the underlying compositional structure, even in the case of a high-dimensional input space.  At the same time,   \textsf{Deep--HGP} are conceptually quite simple to construct. One main idea is that the horseshoe prior enables {\em simultaneous} adaptation to both smoothness {\em and}  structure. 
9: \end{abstract}
10: