abstract:134937ab346f2ed0.tex

1: \begin{abstract}

2: Recent developments in multi-agent imitation learning have shown promising results for modeling the behavior of human drivers.

3: However, it is challenging to capture emergent traffic behaviors that are observed in real-world datasets.

4: Such behaviors arise due to the many local interactions between agents that are not commonly accounted for in imitation learning.

5: This paper proposes Reward Augmented Imitation Learning (RAIL), which integrates reward augmentation into the multi-agent imitation learning framework and allows the designer %of the imitation learning agent

6: to specify prior knowledge in a principled fashion.

7: We prove that convergence guarantees for the imitation learning process are preserved under the application of reward augmentation.

8: % The results are benchmarked against existing traditional imitation learning algorithms.

9: This method is validated in a driving scenario, where an entire traffic scene is controlled by driving policies learned using our proposed algorithm.

10: Further, we demonstrate improved performance in comparison to traditional imitation learning algorithms both in terms of the local actions of a single agent and the behavior of emergent properties in complex, multi-agent settings.

11:

12: \end{abstract}

13: