613f54b4661dc3d8.tex
1: \begin{abstract}
2: 	We consider the problem of learning \red{nonlinear dynamical} systems governed by nonlinear state equation $\h_{t+1}=\phi(\h_t,\ub_t;\bteta)+\w_t$. Here $\bteta$ is the unknown system dynamics, $\h_t $ is the state, $\ub_t$ is the input and $\w_t$ is the additive noise vector. We study gradient based algorithms to learn the system dynamics $\bteta$ from samples obtained from a single finite trajectory. \red{If the system is run by a stabilizing input policy, then using a mixing-time argument we show that temporally-dependent samples can be approximated by i.i.d.~samples}. We then develop new guarantees for the uniform convergence of the gradients of the empirical loss \red{induced by these i.i.d.~samples}. Unlike existing works, our bounds are noise sensitive which allows for learning ground-truth dynamics with high accuracy and small sample complexity. Together, our results facilitate efficient learning of \red{a broader class of nonlinear dynamical systems as compared to the prior works}. %under stability and one-point convexity conditions}. 
3: We specialize our guarantees to  entrywise nonlinear activations and verify our theory in various numerical experiments. %which also demonstrate that Linear-Quadratic Regulator robustly stabilizes entrywise nonlinear activations.
4: %\citet{krauth2019finite}
5: \end{abstract}
6: