d7a3612dc178ee0c.tex
1: \begin{abstract}
2:   We present a technique for learning control Lyapunov-like functions,
3:   which are used in turn to synthesize controllers for nonlinear
4:   dynamical systems that can stabilize the system, or satisfy
5:   specifications such as remaining inside a safe set, or eventually
6:   reaching a target set while remaining inside a safe set.  The
7:   learning framework uses a \emph{demonstrator} that implements a
8:   black-box, untrusted strategy presumed to solve the problem of
9:   interest, a \emph{learner} that poses finitely many queries to the
10:   demonstrator to infer a candidate function, and a \emph{verifier}
11:   that checks whether the current candidate is a valid control
12:   Lyapunov function. The overall learning framework is iterative,
13:   eliminating a set of candidates on each iteration using the
14:   counterexamples discovered by the verifier and the demonstrations
15:   over these counterexamples. We prove its convergence using
16:   ellipsoidal approximation techniques from convex optimization. We
17:   also implement this scheme using nonlinear MPC controllers to serve
18:   as demonstrators for a set of state and trajectory stabilization
19:   problems for nonlinear dynamical systems. We show how the
20:     verifier can be constructed efficiently using convex relaxations
21:     of the verification problem for polynomial systems to
22:     semi-definite programming (SDP) problem instances. Our approach
23:   is able to synthesize relatively simple polynomial control Lyapunov
24:   functions, and in that process replace the MPC using a guaranteed
25:   and computationally less expensive controller.
26:   \keywords{ Lyapunov Functions \and Controller Synthesis \and Learning from Demonstrations \and Concept Learning.}
27: % \PACS{PACS code1 \and PACS code2 \and more}
28: % \subclass{MSC code1 \and MSC code2 \and more}
29: \end{abstract}
30: