1: \begin{abstract}
2: Lazy graph search algorithms are efficient at solving motion planning problems where edge evaluation is the computational bottleneck.
3: These algorithms work by lazily computing the shortest potentially feasible path, evaluating edges along that path, and repeating until a feasible path is found.
4: The order in which edges are selected is critical to minimizing the total number of edge evaluations:
5: a good edge selector chooses edges that are not only likely to be invalid, but also eliminates future paths from consideration.
6: We wish to learn such a selector by leveraging prior experience.
7: %
8: We formulate this problem as a Markov Decision Process (MDP) on the state of the search problem.
9: While solving this large MDP is generally intractable, % and approximation techniques have poor empirical convergence.
10: we show that %if the latent edge status are known,
11: we can compute oracular selectors that can solve the MDP during training.
12: With access to such oracles, we use imitation learning to find effective policies. If new search problems are sufficiently similar to problems solved during training, the learned policy will choose a good edge evaluation ordering and solve the motion planning problem quickly.
13: %The learning offers valuable insights on which features are relevant to make such selections.
14: We evaluate our algorithms on a wide range of $2$D and $7$D problems and show that the learned selector outperforms baseline commonly used heuristics.
15: \end{abstract}
16: