1: \begin{abstract}
2: %% Text of abstract
3: This paper introduces Deep Policy Iteration (DPI), a novel approach that integrates the strengths of Neural Networks with the stability and convergence advantages of Policy Iteration (PI) to address high-dimensional stochastic Mean Field Games (MFG). DPI overcomes the limitations of PI, which is constrained by the curse of dimensionality to low-dimensional problems, by iteratively training three neural networks to solve PI equations and satisfy forward-backwards conditions. Our findings indicate that DPI achieves comparable convergence levels to the Mean Field Deep Galerkin Method (MFDGM), with additional advantages. Furthermore, deep learning techniques show promise in handling separable Hamiltonian cases where PI alone is less effective. DPI effectively manages high-dimensional problems, extending the applicability of PI to both separable and non-separable Hamiltonians.
4: % To evaluate the reliability and efficacy of DPI, a series of numerical experiments is conducted.
5: % The results obtained using DPI are compared with those obtained using the MFDGM method and the Policy Iteration Method. This comparative analysis provides insights into the performance of DPI and its advantages over existing methods.
6:
7:
8:
9:
10:
11: \end{abstract}
12: