555b99c0ddd19595.tex
1: \begin{abstract}
2:   Despite advances in scalable models, the inference tools used for Gaussian processes (GPs) have yet to fully capitalize on developments in computing hardware.
3:   We present an efficient and general approach to GP inference based on Blackbox Matrix-Matrix multiplication (BBMM).
4:   BBMM inference uses a modified \emph{batched} version of the conjugate gradients algorithm to derive all terms required for training and inference in a single call.
5:   This reduces the asymptotic complexity of exact GP inference from $\bigo{n^3}$ to $\bigo{n^2}$.
6:   Adapting this algorithm to complex GP models simply requires a routine for efficient matrix-matrix multiplication with the kernel and its derivative.
7:   In addition, BBMM uses a specialized preconditioner to substantially speed up convergence.
8:   In experiments we show that BBMM effectively uses GPU acceleration --
9:   exact GP inference and scalable approximations are \emph{up to $20$ times faster} than existing methods.
10:   Additionally, we provide \emph{GPyTorch}, a software platform for scalable GP inference via BBMM, built on PyTorch.
11: \end{abstract}
12: