abstract:a91f8d3525ff95bf.tex

1: \begin{abstract}

2: Entropy regularization has been extensively adopted   to improve the efficiency, the stability, and the convergence of algorithms in reinforcement learning.

3: This paper analyzes  both quantitatively and qualitatively the impact of entropy regularization for Mean Field Games (MFGs) with learning in a finite time horizon.  Our study provides a theoretical justification that entropy regularization yields time-dependent policies and, furthermore, helps stabilizing and accelerating  convergence to the game equilibrium.

4: In addition, this study leads to a policy-gradient algorithm with exploration in MFG. With this algorithm,  agents are able to  learn the optimal exploration scheduling, with stable and fast

5: convergence to the game equilibrium.

6: \end{abstract}

7: