138aa3ae40db7b72.tex
1: \begin{abstract}
2: Bayesian optimisation is a popular approach for optimising expensive 
3: black-box functions. The next location to be evaluated is selected via
4: maximising an acquisition function that balances exploitation and exploration.
5: Gaussian processes, the surrogate models of choice in Bayesian optimisation, 
6: are often used with a constant prior mean function equal to the arithmetic mean
7: of the observed function values. We show that the rate of convergence can
8: depend sensitively on the choice of mean function.  We empirically investigate
9: 8 mean functions (constant functions equal to the arithmetic mean,
10: minimum, median and maximum of the observed function evaluations, linear, 
11: quadratic polynomials, random forests and RBF networks), using 10 synthetic 
12: test problems and two real-world problems, and using the Expected Improvement
13: and Upper Confidence Bound acquisition functions.
14: 
15: We find that for design dimensions $\ge5$ using a constant mean function equal 
16: to the worst observed quality value is consistently the best choice on the 
17: synthetic problems considered. We argue that this worst-observed-quality 
18: function promotes exploitation leading to more rapid convergence. However, for
19: the real-world tasks the more complex mean functions capable of modelling the
20: fitness landscape may be effective, although there is no clearly optimum
21: choice.
22: \end{abstract}
23: