abstract:58765c4bfd795695.tex

1: \begin{abstract}

2: {\sloppy

3: In AI research, synthesizing a plan of action has typically used \textit{descriptive models} of the actions that abstractly specify \textit{what} might happen as a result of an action, and are tailored for efficiently computing state transitions. However, executing the planned actions has needed \textit{operational} models,

4: in which rich computational control structures and closed-loop online decision-making are used

5: to specify \textit{how} to perform an action in a nondeterministic  execution context, react to events and adapt to an unfolding situation.

6: \emph{Deliberative actors}, which integrate acting and planning, have typically needed to use both of these models together---which causes problems when attempting to develop the different models, verify their consistency, and smoothly interleave acting and planning.

7:

8: As an alternative, we define and implement an integrated acting and planning system in which both planning and acting use the same operational models. These rely on hierarchical task-oriented \textit{refinement methods} offering rich control structures.

9: The acting component, called Reactive Acting Engine (\RAE), is inspired by the well-known PRS system. At each decision step, \RAE can get advice from a planner for a near-optimal choice with respect to an utility function. The anytime planner uses a UCT-like  Monte Carlo Tree Search procedure, called \UPOM,

10: whose rollouts are simulations of the actor's operational models. We also present learning strategies for use with \RAE and \UPOM that acquire, from online acting experiences and/or simulated planning results, a mapping from decision contexts to method instances as well as a heuristic function to guide \PLAN.

11: We demonstrate the asymptotic convergence of \PLAN towards  optimal methods  in static domains, and show experimentally that \UPOM and the learning strategies significantly improve the acting efficiency and robustness.}

12:

13: \end{abstract}

14: