abstract:012dee1efee2640f.tex

1: \begin{abstract}

2: This paper proposes a time-delayed data informed reinforcement learning method, referred as incremental adaptive dynamic programming, to learn approximate solutions to optimal tracking control problems (OTCPs) of high-dimensional nonlinear systems. Departing from available solutions to OTCPs, our developed tracking control scheme settles the curse of complexity problem in value function approximation from a decoupled way, circumvents the learning inefficiency regarding varying desired trajectories by avoiding introducing a reference trajectory dynamics into the learning process, and requires neither an accurate nor identified dynamics using time-delayed signals. Specifically, the intractable OTCP of a high-dimensional uncertain system is first converted into multiple manageable sub-OTCPs of low-dimensional incremental subsystems constructed using time-delayed data. Then, the resulting sub-OTCPs are approximately solved by a parallel critic learning structure. The proposed tracking control scheme is developed with rigorous theoretical analysis of system stability and weight convergence, and validated experimentally on a 3-DoF robot manipulator.

3: \end{abstract}

4: