abstract:7cda300f5fcd7a00.tex

1: \begin{abstract}

2: In this paper, we develop a novel second-order method for training feed-forward neural nets.

3: At each iteration, we construct a quadratic approximation to the cost function in a low-dimensional subspace.

4: We minimize this approximation inside a trust region through a two-stage procedure:

5: first inside the embedded positive curvature subspace, followed by a gradient descent step.

6: This approach leads to a fast objective function decay, prevents convergence to saddle points, and alleviates the need for manually tuning parameters. We show the good performance of the proposed algorithm on benchmark datasets.

7: \end{abstract}