abstract:ced70e0fffb7980d.tex

1: \begin{abstract}

2: The performance of deep (reinforcement) learning systems crucially

3: depends on the choice of hyperparameters. Their tuning is notoriously

4: expensive, typically requiring an iterative training process to run

5: for numerous steps to convergence. Traditional tuning algorithms only

6: consider the final performance of hyperparameters acquired after many

7: expensive iterations and ignore intermediate information from earlier

8: training steps. In this paper, we present a Bayesian optimization (BO)

9: approach which exploits the iterative structure of learning algorithms

10: for efficient hyperparameter tuning. We propose to learn an evaluation

11: function compressing learning progress at any stage of the training

12: process into a single numeric score according to both training success

13: and stability. Our BO framework is then balancing the benefit of assessing

14: a hyperparameter setting over additional training steps against their

15: computation cost. We further increase model efficiency by selectively

16: including scores from different training steps for any evaluated hyperparameter

17: set. We demonstrate the efficiency of our algorithm by tuning hyperparameters

18: for the training of deep reinforcement learning agents and convolutional

19: neural networks. Our algorithm outperforms all existing baselines

20: in identifying optimal hyperparameters in minimal time.

21:

22: \end{abstract}

23: