abstract:fe43093fd059c8f2.tex

1: \begin{abstract}

2: We study the problem of adaptive control of a high dimensional linear quadratic (LQ) system.

3: Previous work established the asymptotic convergence to an optimal controller for various adaptive control schemes.

4: More recently, for the average cost LQ problem, a regret bound of ${O}(\sqrt{T})$ was shown, apart form logarithmic factors.

5: However, this bound scales exponentially with $p$, the dimension of the state space.

6: In this work we consider the case where the matrices describing the dynamic of the LQ system are sparse and their dimensions are large.

7: We present an adaptive control scheme that achieves a regret bound of ${O}(p \sqrt{T})$, apart from logarithmic factors.

8: In particular, our algorithm has an average cost of $(1+\eps)$ times the optimum cost after $T = \polylog(p) O(1/\eps^2)$.

9: This is in comparison to previous work on the dense dynamics where the algorithm requires time that scales exponentially with dimension in order to achieve regret of $\eps$ times the optimal cost.

10:

11: We believe that our result has prominent applications in the emerging area of computational advertising, in particular targeted online advertising and advertising in social networks.

12: \end{abstract}

13: