e4e883dc1c4f51ec.tex
1: \begin{abstract}
2:   This paper focuses on the active flow control of a computational fluid
3:   dynamics simulation over a range of Reynolds numbers using deep reinforcement
4:   learning (DRL). More precisely, the proximal policy optimization (PPO) method
5:   is used to control the mass flow rate of four synthetic jets symmetrically
6:   located on the upper and lower sides of a cylinder immersed in a
7:   two-dimensional flow domain. The learning environment supports four flow
8:   configurations with Reynolds numbers 100, 200, 300 and 400, respectively. A
9:   new smoothing interpolation function is proposed to help the PPO algorithm to
10:   learn to set continuous actions, which is of great importance to effectively
11:   suppress problematic jumps in lift and allow a better convergence for the
12:   training process. It is shown that the DRL controller is able to significantly
13:   reduce the lift and drag fluctuations and to actively reduce the drag by
14:   approximately 5.7\%, 21.6\%, 32.7\%, and 38.7\%, at $Re$=100, 200, 300, and
15:   400 respectively. More importantly, it can also effectively reduce drag for
16:   any previously unseen value of the Reynolds number between 60 and 400. This
17:   highlights the generalization ability of deep neural networks and is an
18:   important milestone towards the development of practical applications of DRL
19:   to active flow control.
20: \end{abstract}
21: