a8973d07bd3aa9d9.tex
1: \begin{abstract}
2: Many real-world functions are defined over both categorical and \textit{category-specific}
3: continuous variables and thus cannot be optimized by traditional Bayesian
4: optimization (BO) methods. To optimize such functions, we propose
5: a new method that formulates the problem as a multi-armed bandit problem,
6: wherein each category corresponds to an arm with its reward distribution
7: centered around the optimum of the objective function in continuous
8: variables. Our goal is to identify the best arm and the maximizer
9: of the corresponding continuous function simultaneously. Our algorithm
10: uses a Thompson sampling scheme that helps connecting both multi-arm
11: bandit and BO in a unified framework. We extend our method to \textit{batch}
12: BO to allow parallel optimization when multiple resources are available.
13: We theoretically analyze our method for convergence and prove sub-linear
14: regret bounds. We perform a variety of experiments: optimization of
15: several benchmark functions, hyper-parameter tuning of a neural network,
16: and automatic selection of the best machine learning model along with
17: its optimal hyper-parameters (a.k.a \textit{automated machine learning}).
18: Comparisons with other methods demonstrate the effectiveness of our
19: proposed method.
20: \end{abstract}