1: \begin{abstract}
2: The performance of deep (reinforcement) learning systems crucially
3: depends on the choice of hyperparameters. Their tuning is notoriously
4: expensive, typically requiring an iterative training process to run
5: for numerous steps to convergence. Traditional tuning algorithms only
6: consider the final performance of hyperparameters acquired after many
7: expensive iterations and ignore intermediate information from earlier
8: training steps. In this paper, we present a Bayesian optimization (BO)
9: approach which exploits the iterative structure of learning algorithms
10: for efficient hyperparameter tuning. We propose to learn an evaluation
11: function compressing learning progress at any stage of the training
12: process into a single numeric score according to both training success
13: and stability. Our BO framework is then balancing the benefit of assessing
14: a hyperparameter setting over additional training steps against their
15: computation cost. We further increase model efficiency by selectively
16: including scores from different training steps for any evaluated hyperparameter
17: set. We demonstrate the efficiency of our algorithm by tuning hyperparameters
18: for the training of deep reinforcement learning agents and convolutional
19: neural networks. Our algorithm outperforms all existing baselines
20: in identifying optimal hyperparameters in minimal time.
21:
22: \end{abstract}
23: