abstract:378f53dfa4ba3ba9.tex

1: \begin{abstract}

2:   We propose a new globally convergent stochastic second order method. Our starting point is the development of a new Sketched Newton-Raphson (\SNR) method for solving large scale nonlinear equations of the form $F(x)=0$ with $F:\R^p \rightarrow \R^m$.

3:   We then show how to design several stochastic second order optimization methods by re-writing the optimization problem of interest as a system of nonlinear equations and applying \SNR. For instance, by applying \SNR to find a stationary point of a generalized linear model (GLM), we derive completely new and scalable stochastic second order methods. We show that the resulting method is very competitive as compared to state-of-the-art variance reduced methods.

4:   Furthermore, using a variable splitting trick, we also show that the \emph{Stochastic Newton method} (\SNM) is a special case of \SNR, and use this connection to establish the first global convergence theory of \SNM.

5:

6:   We establish the global convergence of \SNR by showing that it is

7: %  Indeed, by showing that \SNR can be interpreted as

8:   a variant of the online stochastic gradient descent (\SGD) method, and then leveraging proof techniques of \SGD.

9:   As a special case, our theory also provides a new global convergence theory for the original Newton-Raphson method under strictly weaker assumptions as compared to the classic monotone convergence theory.

10: %   what is commonly used for global convergence.

11: \end{abstract}

12: