1: \begin{abstract}
2: Recent developments in multi-agent imitation learning have shown promising results for modeling the behavior of human drivers.
3: However, it is challenging to capture emergent traffic behaviors that are observed in real-world datasets.
4: Such behaviors arise due to the many local interactions between agents that are not commonly accounted for in imitation learning.
5: This paper proposes Reward Augmented Imitation Learning (RAIL), which integrates reward augmentation into the multi-agent imitation learning framework and allows the designer %of the imitation learning agent
6: to specify prior knowledge in a principled fashion.
7: We prove that convergence guarantees for the imitation learning process are preserved under the application of reward augmentation.
8: % The results are benchmarked against existing traditional imitation learning algorithms.
9: This method is validated in a driving scenario, where an entire traffic scene is controlled by driving policies learned using our proposed algorithm.
10: Further, we demonstrate improved performance in comparison to traditional imitation learning algorithms both in terms of the local actions of a single agent and the behavior of emergent properties in complex, multi-agent settings.
11:
12: \end{abstract}
13: