5ece4c36da0c5cf6.tex
1: \begin{abstract}
2: \noindent
3: Modern Monte Carlo-type approaches 
4: to dynamic decision problems
5: face the classical bias-variance trade-off.
6: Deep neural networks can
7: overlearn the data and construct
8: feedback actions which are non-adapted to
9: the information flow and hence,  
10: become susceptible to generalization error.
11: We prove 
12: asymptotic overlearning for fixed training sets, but  also
13: provide a non-asymptotic upper bound on overperformance
14: based on the Rademacher complexity
15: demonstrating the convergence of these algorithms
16: for sufficiently large training sets.
17: Numerically studied
18: stylized examples illustrate these possibilities, the dependence on the 
19: dimension and the  effectiveness of this approach.
20:  \end{abstract}
21: