7cda300f5fcd7a00.tex
1: \begin{abstract} 
2: In this paper, we develop a novel second-order method for training feed-forward neural nets. 
3: At each iteration, we construct a quadratic approximation to the cost function in a low-dimensional subspace.
4: We minimize this approximation inside a trust region through a two-stage procedure:
5: first inside the embedded positive curvature subspace, followed by a gradient descent step. 
6: This approach leads to a fast objective function decay, prevents convergence to saddle points, and alleviates the need for manually tuning parameters. We show the good performance of the proposed algorithm on benchmark datasets.
7: \end{abstract}