6058e51bdc09a886.tex
1: \begin{abstract}  We prove statistical rates of convergence for kernel-based
2: least squares regression from i.i.d. data using a conjugate gradient algorithm,
3: where regularization against overfitting is obtained by early stopping.
4: This method is related to Kernel Partial Least Squares, a
5:  regression method that combines supervised dimensionality reduction with least squares projection.
6:  Following the setting introduced in earlier related literature,
7:  we study so-called ``fast convergence rates'' depending
8:  on the regularity of the target regression function (measured by a source condition
9:  in terms of the kernel integral operator) and
10:  on the effective dimensionality of the data
11:  mapped into the kernel space. We obtain upper bounds,
12:  essentially matching known minimax lower bounds, for the $\cL^2$ (prediction) norm
13:  as well as for the stronger Hilbert norm, if  the
14:  true regression function belongs to the reproducing kernel Hilbert space.
15:  If the latter assumption is not fulfilled, we obtain similar convergence rates
16:  for appropriate norms, provided additional unlabeled data are available.
17: \end{abstract}
18: