abstract:be57aeef7e4debe7.tex

1: \begin{abstract}

2: We develop a mathematically rigorous framework for multilayer neural

3: networks in the mean field regime. As the network's width increases,

4: the network's learning trajectory is shown to be well captured by

5: a meaningful and dynamically nonlinear limit (the \textit{mean field}

6: limit), which is characterized by a system of ODEs. Our framework

7: applies to a broad range of network architectures, learning dynamics

8: and network initializations. Central to the framework is the new idea

9: of a \textit{neuronal embedding}, which comprises of a non-evolving

10: probability space that allows to embed neural networks of arbitrary

11: widths.

12:

13: We demonstrate two applications of our framework. Firstly the framework

14: gives a principled way to study the simplifying effects that independent

15: and identically distributed initializations have on the mean field

16: limit. Secondly we prove a global convergence guarantee for two-layer

17: and three-layer networks. Unlike previous works that rely on convexity,

18: our result requires a certain universal approximation property, which

19: is a distinctive feature of infinite-width neural networks. To the

20: best of our knowledge, this is the first time global convergence is

21: established for neural networks of more than two layers in the mean

22: field regime.

23: \end{abstract}

24: