abstract:cab369770653c2ef.tex

1: \begin{abstract}

2: Lazy graph search algorithms are efficient at solving motion planning problems where edge evaluation is the computational bottleneck.

3: These algorithms work by lazily computing the shortest potentially feasible path, evaluating edges along that path, and repeating until a feasible path is found.

4: The order in which edges are selected is critical to minimizing the total number of edge evaluations:

5: a good edge selector chooses edges that are not only likely to be invalid, but also eliminates future paths from consideration.

6: We wish to learn such a selector by leveraging prior experience.

7: %

8: We formulate this problem as a Markov Decision Process (MDP) on the state of the search problem.

9: While solving this large MDP is generally intractable, % and approximation techniques have poor empirical convergence.

10:  we show that %if the latent edge status are known,

11:  we can compute oracular selectors that can solve the MDP during training.

12: With access to such oracles, we  use imitation learning to find effective policies. If new search problems are sufficiently similar to problems solved during training, the learned policy will choose a good edge evaluation ordering and solve the motion planning problem quickly.

13: %The learning offers valuable insights on which features are relevant to make such selections.

14: We evaluate our algorithms on a wide range of $2$D and $7$D problems and show that the learned selector outperforms baseline commonly used heuristics.

15: \end{abstract}

16: