abstract:43c195022a12f731.tex

1: \begin{abstract}

2: We propose a globally convergent multilevel training method for deep residual networks (ResNets).

3: The devised method can be seen as a novel variant of the recursive multilevel trust-region (RMTR) method, which operates in hybrid (stochastic-deterministic) settings by adaptively adjusting mini-batch sizes during the training.

4: The multilevel hierarchy and the transfer operators are constructed by exploiting a dynamical system's viewpoint, which interprets forward propagation through the ResNet as a forward Euler discretization of an initial value problem.

5: In contrast to traditional training approaches, our novel RMTR method also incorporates curvature information on all levels of the multilevel hierarchy by means of the limited-memory SR1 method.

6: The overall performance and the convergence properties of our  multilevel training method are numerically investigated using examples from the field of classification and regression.

7: \end{abstract}

8: