abstract:c227277daea34fff.tex

1: \begin{abstract}

2: Gaussian processes (GP) are a well studied Bayesian approach for the optimization of black-box functions. Despite their effectiveness in simple problems, GP-based algorithms hardly scale to high-dimensional functions, as their per-iteration time and space cost is at least \emph{quadratic} in the number of dimensions $d$ and iterations~$t$. Given a set of $\Narm$ alternatives to choose from, the overall runtime $\bigotime(t^3\Narm)$ is prohibitive.

3: In this paper we introduce \bkb (\textit{budgeted kernelized bandit}), a new approximate GP algorithm for optimization under bandit feedback that achieves near-optimal regret (and hence near-optimal convergence rate) with near-constant per-iteration complexity and remarkably no assumption on the input space or covariance of the GP.

4:

5: We combine a kernelized linear bandit algorithm (GP-UCB) with randomized matrix sketching based on leverage score sampling, and we prove that randomly sampling inducing points based on their posterior variance gives an accurate low-rank approximation of the GP, preserving variance estimates and confidence intervals. As a consequence, \bkb does not suffer from \emph{variance starvation}, an important problem faced by many previous sparse GP approximations.

6: Moreover, we show that our procedure

7: 	selects at most $\wt{\bigotime}(\deff)$ points, where $\deff$

8: 	is the \emph{effective} dimension of the explored space, which is typically much

9: 	smaller than both $d$ and $t$.

10: 	This greatly reduces the dimensionality of the

11: 	problem, thus leading to a $\bigotime(T\Narm\deff^2)$ runtime and $\bigotime(

12:     \Narm \deff)$ space complexity.

13: \end{abstract}

14: