abstract:77c0032a6fc5e7aa.tex

1: \begin{abstract}

2: In stochastic systems, risk-sensitive control balances performance with resilience to less likely events. Although existing methods rely on finite-horizon risk criteria, this paper introduces \textit{ergodic-risk criteria} that capture long-term cumulative risks through probabilistic limit theorems. Extending the Linear Quadratic Regulation (LQR) framework, we incorporate constraints on these ergodic-risk criteria derived from the asymptotic behavior of cumulative costs, accounting for extreme deviations.

3: Using tailored Functional Central Limit Theorems (FCLT), we demonstrate that the time-correlated terms in the ergodic-risk criteria converge under strong ergodicity, and establish conditions for convergence in non-stationary settings while characterizing the distribution and providing explicit formulations for the limiting variance of the risk functional.

4: The FCLT is developed by applying ergodic theory for Markov chains and obtaining \textit{uniform ergodicity} of the controlled process.

5: For quadratic risk functionals on linear dynamics, in addition to internal stability, the uniform ergodicity requires the (possibly heavy-tailed) dynamic noise to have a finite fourth moment.

6: This offers a clear path to quantifying long-term uncertainty. We also propose a primal-dual constrained policy optimization method that optimizes the average performance while ensuring ergodic-risk constraints are satisfied. Our framework offers a practical, theoretically guaranteed approach for long-term risk-sensitive control, backed by convergence guarantees and validations through simulations.

7: \end{abstract}

8: