1: \begin{abstract}% <- trailing '%' for backward compatibility of .sty file
2: We study a continuous-time approximation of the stochastic gradient descent process for
3: minimizing the expected loss in learning problems. The main results establish general sufficient
4: conditions for the convergence, extending the results of \cite{C22} established for (nonstochastic)
5: gradient descent. We show how the main result can be applied to the case of overparametrized linear
6: neural network training.
7: \end{abstract}
8: