4092f723f478db85.tex
1: \begin{abstract}
2: Large-scale multi-agent pathfinding (MAPF) presents significant challenges in several areas, such as autonomous vehicle management in smart cities and warehouse robot control. As systems grow in complexity, efficient and collision-free coordination becomes paramount. Traditional algorithms often fall short in scalability, especially in intricate scenarios. Reinforcement Learning (RL) has shown potential to address the intricacies of MAPF; however, it has also been shown to struggle with scalability, demanding intricate implementation, lengthy training, and often exhibiting unstable convergence, limiting its practical application. In this paper, we introduce Heuristics-Informed Multi-Agent Pathfinding (\our{}), a novel scalable approach that employs imitation learning with heuristic guidance in a decentralized manner. We train on small-scale instances using a heuristic policy as a teacher that maps each single agent observation information to an action probability distribution. During pathfinding, we adopt several strategies to improve performance such as \textit{Preventing Re-Visit}, \textit{Treating Completed Agents as Obstacles} and \textit{Softmax Temperature Adaptation}. With a simple training scheme and implementation, \our{} demonstrates competitive results in terms of success rate and scalability in the field of imitation-learning-only MAPF, showing the potential of imitation-learning-only MAPF
3: equipped with inference techniques.
4: \end{abstract}
5: