84e78eee60f9a957.tex
1: \begin{abstract}
2: 		This paper introduces a novel operator, termed the $\mathcal{Y}$ operator, to elevate control performance in Actor-Critic (AC) based reinforcement learning for systems governed by stochastic differential equations (SDEs). The $\mathcal{Y}$ operator ingeniously integrates the stochasticity of a class of child-mother system into the Critic network’s loss function, yielding substantial advancements in the control performance of RL algorithms. Additionally, the $\mathcal{Y}$ operator elegantly reformulates the challenge of solving partial differential equations for the state-value function into a parallel problem for the drift and diffusion functions within the system's SDEs.  A rigorous mathematical proof confirms the operator's validity. This transformation enables the $\mathcal{Y}$ Operator-based Reinforcement Learning (YORL) framework to efficiently tackle optimal control problems in both model-based and data-driven systems. The superiority of YORL is demonstrated through linear and nonlinear numerical examples, showing its enhanced performance over existing methods post convergence.
3: 	\end{abstract}
4: