1: \begin{abstract}
2: Gaussian processes (GP) are a well studied Bayesian approach for the optimization of black-box functions. Despite their effectiveness in simple problems, GP-based algorithms hardly scale to high-dimensional functions, as their per-iteration time and space cost is at least \emph{quadratic} in the number of dimensions $d$ and iterations~$t$. Given a set of $\Narm$ alternatives to choose from, the overall runtime $\bigotime(t^3\Narm)$ is prohibitive.
3: In this paper we introduce \bkb (\textit{budgeted kernelized bandit}), a new approximate GP algorithm for optimization under bandit feedback that achieves near-optimal regret (and hence near-optimal convergence rate) with near-constant per-iteration complexity and remarkably no assumption on the input space or covariance of the GP.
4:
5: We combine a kernelized linear bandit algorithm (GP-UCB) with randomized matrix sketching based on leverage score sampling, and we prove that randomly sampling inducing points based on their posterior variance gives an accurate low-rank approximation of the GP, preserving variance estimates and confidence intervals. As a consequence, \bkb does not suffer from \emph{variance starvation}, an important problem faced by many previous sparse GP approximations.
6: Moreover, we show that our procedure
7: selects at most $\wt{\bigotime}(\deff)$ points, where $\deff$
8: is the \emph{effective} dimension of the explored space, which is typically much
9: smaller than both $d$ and $t$.
10: This greatly reduces the dimensionality of the
11: problem, thus leading to a $\bigotime(T\Narm\deff^2)$ runtime and $\bigotime(
12: \Narm \deff)$ space complexity.
13: \end{abstract}
14: