abstract:36b3386030ae8213.tex

1: \begin{abstract}

2: This paper studies an application at the intersection of learning and control. Suppose we have a set of linear autonomous systems with bounded process noise, but the dynamics of each system are unknown. The goal of the application is to design a policy that stabilizes the system. The underlying question is how to estimate the dynamics of each system given that measurements of each system will be nonsequential. Though seemingly straightforward, existing proof techniques for proving statistical consistency of system identification procedures fail when measurements are nonsequential. Here, we provide an estimator that leverages boundedness of the process noise, and we prove its statistical consistency holds even when measurements are nonsequential. We illustrate the strong consistency (i.e., almost sure convergence) of our estimator by using it to construct a stabilizing policy for the motivating learning-based switched control application.

3: \end{abstract}