1: \begin{abstract}
2: Deep neural networks are considered to be state of the art models in many offline machine learning tasks. However, their performance and generalization abilities in online learning tasks are much less understood. Therefore, we focus on online learning and tackle the challenging problem where the underlying process is stationary and ergodic and thus removing the i.i.d. assumption and allowing observations to depend on each other arbitrarily.
3:
4: We prove the generalization abilities of Lipschitz regularized deep neural networks and show that by using those networks, a convergence to the best possible prediction strategy is guaranteed.
5:
6: % Online for different time sclaes is a requirment for many online learning tasks. In this paper we focus on the problem of long term prediction where the learner is allowed to switch his prediction to a short time experts but for a paying costs. We present a novel algorithm for combining long term and short term experts and proove that our algorithm has bounded regret from the best long term expert while chossing the best places to switch his prediction to long term experts.
7: % Numerical examples
8: % validate the viability of our method and show significant improvement over the state-of-the-art.
9:
10:
11: \end{abstract}