a91f8d3525ff95bf.tex
1: \begin{abstract}
2: Entropy regularization has been extensively adopted   to improve the efficiency, the stability, and the convergence of algorithms in reinforcement learning.
3: This paper analyzes  both quantitatively and qualitatively the impact of entropy regularization for Mean Field Games (MFGs) with learning in a finite time horizon.  Our study provides a theoretical justification that entropy regularization yields time-dependent policies and, furthermore, helps stabilizing and accelerating  convergence to the game equilibrium. 
4: In addition, this study leads to a policy-gradient algorithm with exploration in MFG. With this algorithm,  agents are able to  learn the optimal exploration scheduling, with stable and fast  
5: convergence to the game equilibrium.
6: \end{abstract}
7: