1: \begin{abstract} We prove statistical rates of convergence for kernel-based
2: least squares regression from i.i.d. data using a conjugate gradient algorithm,
3: where regularization against overfitting is obtained by early stopping.
4: This method is related to Kernel Partial Least Squares, a
5: regression method that combines supervised dimensionality reduction with least squares projection.
6: Following the setting introduced in earlier related literature,
7: we study so-called ``fast convergence rates'' depending
8: on the regularity of the target regression function (measured by a source condition
9: in terms of the kernel integral operator) and
10: on the effective dimensionality of the data
11: mapped into the kernel space. We obtain upper bounds,
12: essentially matching known minimax lower bounds, for the $\cL^2$ (prediction) norm
13: as well as for the stronger Hilbert norm, if the
14: true regression function belongs to the reproducing kernel Hilbert space.
15: If the latter assumption is not fulfilled, we obtain similar convergence rates
16: for appropriate norms, provided additional unlabeled data are available.
17: \end{abstract}
18: