1: \begin{abstract}
2: % We prove tight bounds on the smallest eigenvalue of the NTK for deep ReLU networks.
3: A recent line of work has analyzed the theoretical properties of deep neural networks via the Neural Tangent Kernel (NTK).
4: In particular, the smallest eigenvalue of the NTK has been related to memorization capacity,
5: convergence of gradient descent algorithms and generalization of deep nets.
6: However, existing results either provide bounds in the two-layer setting or assume that the spectrum of the NTK
7: is bounded away from 0 for multi-layer networks.
8: In this paper, we provide tight bounds on the smallest eigenvalue of NTK matrices for deep ReLU networks,
9: both in the limiting case of infinite widths and for finite widths.
10: In the finite-width setting, the network architectures we consider are quite general:
11: we require the existence of a wide layer with roughly order of $N$ neurons,
12: $N$ being the number of data samples; and the scaling of the remaining widths is arbitrary (up to logarithmic factors).
13: To obtain our results, we analyze various quantities of independent interest:
14: we give lower bounds on the smallest singular value of feature matrices,
15: and upper bounds on the Lipschitz constant of input-output feature maps.
16: \end{abstract}
17: