1: \begin{abstract}
2: This paper proposes an offline control algorithm, called Recurrent Model Predictive Control (RMPC), in order to solve large-scale nonlinear finite-horizon optimal control problems.
3: As an enhancement of traditional Model Predictive Control (MPC) algorithms, it can adaptively select appropriate model prediction horizon according to current computing resources, so as to improve the policy performance. Our algorithm employs a recurrent function to approximate the optimal policy, which maps the system states and reference values directly to the control inputs. The output of the learned policy network after $N$ recurrent cycles corresponds to the nearly optimal solution of $N$-step MPC.
4: %The number of prediction steps is equal to the number of recurrent cycles of the learned policy function. With an arbitrary initial policy, the proposed RMPC algorithm can converge to the optimal policy via directly minimizing the designed loss function.
5: A policy optimization objective is designed by decomposing the MPC cost function according to the Bellman's principle of optimality. The optimal recurrent policy can be obtained by directly minimizing the designed objective function, which is applicable for general nonlinear and non input-affine systems.
6: %We further prove the convergence and optimality of the \textcolor{red}{proposed} RMPC algorithm through the Bellman's principle of optimality.
7: The hardware-in-the-Loop (HIL) experiment is performed to demonstrate its generality and efficiency. Results show that RMPC is over 5 times faster than the traditional MPC algorithm under identical problem scale.
8: \end{abstract}
9: