1: \begin{abstract}
2: Given access to accurate dynamical models, modern planning approaches
3: are effective in computing feasible and optimal plans for repetitive
4: robotic tasks. However, it is difficult to model the true dynamics of
5: the real world before execution, especially for tasks requiring
6: interactions with objects whose parameters are unknown. A recent
7: planning approach, \cmax{}, tackles this problem by adapting the
8: planner online during execution to bias the resulting plans away from
9: inaccurately modeled regions. \cmax{}, while being provably guaranteed
10: to reach the goal, requires strong assumptions on the accuracy of the
11: model used for planning and fails to improve the quality of the solution
12: over repetitions of the same task. In this paper we propose \cmaxpp{},
13: an approach that leverages real-world experience to improve the
14: quality of resulting plans over successive repetitions of a robotic
15: task.
16: % while requiring minimal assumptions on the model used for
17: % planning
18: \cmaxpp{} achieves this by integrating model-free learning
19: using acquired experience with model-based planning using the
20: potentially inaccurate model. We provide provable guarantees on the
21: completeness and asymptotic convergence of \cmaxpp{} to the optimal
22: path cost as the number of repetitions increases. \cmaxpp{} is also shown
23: to outperform baselines in simulated robotic tasks including 3D
24: mobile robot navigation where the track friction is incorrectly
25: modeled, and a 7D pick-and-place task where the mass of the object is
26: unknown leading to discrepancy between true and modeled
27: dynamics.\footnote{A blog post summarizing this work can be found at
28: \url{https://vvanirudh.github.io/blog/cmaxpp/}}
29: \end{abstract}
30: