dc2ca5cb327d6df7.tex
1: \begin{abstract}
2: 
3: We study stochastic inexact Newton methods and consider their
4: application in nonconvex settings. Building on the work of
5: [R. Bollapragada, R. H. Byrd, and J. Nocedal, IMA Journal of Numerical
6:   Analysis, 39 (2018), pp. 545--578] we derive bounds for convergence rates in
7: expected value for stochastic low rank Newton methods, and
8: stochastic inexact Newton Krylov methods. These bounds
9: quantify the errors incurred in subsampling the Hessian and gradient,
10: as well as in approximating the Newton linear solve, and in choosing
11: regularization and step length parameters. We deploy these methods in
12: training convolutional autoencoders for the MNIST and CIFAR10
13: data sets. Numerical results demonstrate that, relative to first order
14: methods, these stochastic inexact Newton methods often converge
15: faster, are more cost-effective, and generalize better.
16: 
17: 
18: \end{abstract}
19: