abstract:ae71fbfb482e24e6.tex

1: \begin{abstract}

2: Interpretation of Deep Neural Networks (DNNs) training as an optimal control problem with

3: nonlinear dynamical systems has received considerable attention recently, yet the algorithmic development remains relatively limited.

4: In this work, we make an attempt along this line

5: by reformulating the training procedure from the trajectory optimization perspective.

6: We first show that most widely-used algorithms for training DNNs can be linked to the

7: Differential Dynamic Programming (DDP),

8: a celebrated second-order method

9: rooted in the Approximate Dynamic Programming.

10: In this vein, we propose a new class of optimizer, DDP Neural Optimizer (DDP-NOpt),

11: for training {feedforward} and convolution networks.

12: DDPNOpt features layer-wise feedback policies which improve convergence and

13: reduce sensitivity to hyper-parameter over existing methods.

14: It outperforms other optimal-control inspired training methods in both convergence and complexity,

15: and is competitive against state-of-the-art first and second order methods.

16: We also observe DDPNOpt has surprising benefit in preventing gradient vanishing.

17: Our work opens up new avenues for principled algorithmic design built upon the optimal control theory.

18: \end{abstract}

19: