abstract:138aa3ae40db7b72.tex

1: \begin{abstract}

2: Bayesian optimisation is a popular approach for optimising expensive

3: black-box functions. The next location to be evaluated is selected via

4: maximising an acquisition function that balances exploitation and exploration.

5: Gaussian processes, the surrogate models of choice in Bayesian optimisation,

6: are often used with a constant prior mean function equal to the arithmetic mean

7: of the observed function values. We show that the rate of convergence can

8: depend sensitively on the choice of mean function.  We empirically investigate

9: 8 mean functions (constant functions equal to the arithmetic mean,

10: minimum, median and maximum of the observed function evaluations, linear,

11: quadratic polynomials, random forests and RBF networks), using 10 synthetic

12: test problems and two real-world problems, and using the Expected Improvement

13: and Upper Confidence Bound acquisition functions.

14:

15: We find that for design dimensions $\ge5$ using a constant mean function equal

16: to the worst observed quality value is consistently the best choice on the

17: synthetic problems considered. We argue that this worst-observed-quality

18: function promotes exploitation leading to more rapid convergence. However, for

19: the real-world tasks the more complex mean functions capable of modelling the

20: fitness landscape may be effective, although there is no clearly optimum

21: choice.

22: \end{abstract}

23: