abstract:dd7e9a5e28decaee.tex

1: \begin{abstract}

2: We present a new multilevel minimization framework for the training of deep residual networks (ResNets), which has the potential to significantly reduce  training time and effort.

3: Our  framework is based on the dynamical system's viewpoint, which formulates a ResNet as the discretization of an initial value problem.

4: The training process is then formulated as a time-dependent optimal control problem, which we discretize using different time-discretization parameters,

5: eventually generating  multilevel-hierarchy of auxiliary networks with different resolutions.

6: The training of the original ResNet is then enhanced by training the auxiliary networks with reduced resolutions.

7: By design, our framework is conveniently independent of the choice of the training strategy chosen on each level of the multilevel hierarchy.

8: By means of numerical examples, we analyze the convergence behavior of the proposed method and demonstrate its robustness.

9: For our examples we employ a multilevel gradient-based methods.

10: Comparisons with standard single level methods show a speedup of more than factor three while achieving the same validation accuracy.

11: \end{abstract}

12: