1: \begin{abstract}
2: Entropy regularization has been extensively adopted to improve the efficiency, the stability, and the convergence of algorithms in reinforcement learning.
3: This paper analyzes both quantitatively and qualitatively the impact of entropy regularization for Mean Field Games (MFGs) with learning in a finite time horizon. Our study provides a theoretical justification that entropy regularization yields time-dependent policies and, furthermore, helps stabilizing and accelerating convergence to the game equilibrium.
4: In addition, this study leads to a policy-gradient algorithm with exploration in MFG. With this algorithm, agents are able to learn the optimal exploration scheduling, with stable and fast
5: convergence to the game equilibrium.
6: \end{abstract}
7: