1: \begin{abstract}
2: We present a technique for learning control Lyapunov-like functions,
3: which are used in turn to synthesize controllers for nonlinear
4: dynamical systems that can stabilize the system, or satisfy
5: specifications such as remaining inside a safe set, or eventually
6: reaching a target set while remaining inside a safe set. The
7: learning framework uses a \emph{demonstrator} that implements a
8: black-box, untrusted strategy presumed to solve the problem of
9: interest, a \emph{learner} that poses finitely many queries to the
10: demonstrator to infer a candidate function, and a \emph{verifier}
11: that checks whether the current candidate is a valid control
12: Lyapunov function. The overall learning framework is iterative,
13: eliminating a set of candidates on each iteration using the
14: counterexamples discovered by the verifier and the demonstrations
15: over these counterexamples. We prove its convergence using
16: ellipsoidal approximation techniques from convex optimization. We
17: also implement this scheme using nonlinear MPC controllers to serve
18: as demonstrators for a set of state and trajectory stabilization
19: problems for nonlinear dynamical systems. We show how the
20: verifier can be constructed efficiently using convex relaxations
21: of the verification problem for polynomial systems to
22: semi-definite programming (SDP) problem instances. Our approach
23: is able to synthesize relatively simple polynomial control Lyapunov
24: functions, and in that process replace the MPC using a guaranteed
25: and computationally less expensive controller.
26: \keywords{ Lyapunov Functions \and Controller Synthesis \and Learning from Demonstrations \and Concept Learning.}
27: % \PACS{PACS code1 \and PACS code2 \and more}
28: % \subclass{MSC code1 \and MSC code2 \and more}
29: \end{abstract}
30: