1: \begin{abstract}
2: Deep neural networks are powerful tools to model observations over time with non-linear patterns. Despite the widespread use
3: of neural networks in such settings, most theoretical developments of deep neural networks are under the assumption of independent observations, and theoretical results for temporally dependent observations are scarce. To bridge this gap, we study theoretical properties of deep neural networks on modeling non-linear time series data. Specifically, non-asymptotic bounds for prediction error of (sparse) feed-forward neural network with ReLU activation function is established under mixing-type assumptions. These assumptions are mild such that they include a wide range of time series models including auto-regressive models. Compared to independent observations, established convergence rates have additional logarithmic factors to compensate for additional complexity due to dependence among data points. The theoretical results are supported via various numerical simulation settings as well as an application to a macroeconomic data set.
4:
5:
6: % non-linear time series observations. is a promising alternative to the traditional statistic methods in time series forecasting. Despite the widespread use
7: % of neural networks, the theoretical property is barely known in time series setting. Our work proves the convergence rate of neural network with temporal dependent observations and shows that the convergence rate coincides with the rate with independent observations up to a $\mathrm{log}^4n$ factor. The proof is subsequently used to show the convergence property of neural network in fitting auto-regressive model. The results are supported by our simulation.
8: \end{abstract}
9: