8ac920be0c719d03.tex
1: \begin{abstract}
2: In this paper, a convex optimization-based method is proposed for numerically solving dynamic programs in continuous state and action spaces.
3: The key idea is to approximate the output of the Bellman operator at a particular state by the optimal value of a convex program.
4: The approximate Bellman operator has a computational advantage because it involves a convex optimization problem in the case of control-affine systems and convex costs. Using this feature, we propose a simple dynamic programming algorithm to evaluate the approximate value function at  pre-specified grid points by solving convex optimization problems in each iteration.
5: We show that the proposed method approximates the optimal value function with a uniform convergence property in the case of convex optimal value functions.
6: We also propose an \emph{interpolation-free} design method for a control policy, of which performance converges uniformly to the optimum as the grid resolution becomes finer.
7: When a nonlinear control-affine system is considered, 
8: the convex optimization approach provides an approximate  policy with a provable suboptimality bound. 
9: For general cases,
10:  the proposed convex formulation of dynamic programming operators can be  modified as a nonconvex bi-level program, in which the inner problem is a linear program, without losing uniform convergence properties.
11: \end{abstract}
12: