1: \begin{abstract}
2: We develop a mathematically rigorous framework for multilayer neural
3: networks in the mean field regime. As the network's width increases,
4: the network's learning trajectory is shown to be well captured by
5: a meaningful and dynamically nonlinear limit (the \textit{mean field}
6: limit), which is characterized by a system of ODEs. Our framework
7: applies to a broad range of network architectures, learning dynamics
8: and network initializations. Central to the framework is the new idea
9: of a \textit{neuronal embedding}, which comprises of a non-evolving
10: probability space that allows to embed neural networks of arbitrary
11: widths.
12:
13: We demonstrate two applications of our framework. Firstly the framework
14: gives a principled way to study the simplifying effects that independent
15: and identically distributed initializations have on the mean field
16: limit. Secondly we prove a global convergence guarantee for two-layer
17: and three-layer networks. Unlike previous works that rely on convexity,
18: our result requires a certain universal approximation property, which
19: is a distinctive feature of infinite-width neural networks. To the
20: best of our knowledge, this is the first time global convergence is
21: established for neural networks of more than two layers in the mean
22: field regime.
23: \end{abstract}
24: