687781455342a21b.tex
1: \begin{abstract}
2: We propose a deep reinforcement learning (DRL) methodology for the tracking, obstacle avoidance, and formation control of nonholonomic robots.
3: By separating vision-based control into a perception module and a controller module, we can train a DRL agent without sophisticated physics or 3D modeling.
4: In addition, the modular framework averts daunting retrains of an image-to-action end-to-end neural network, and provides flexibility in transferring the controller to different robots.
5: First, we train a convolutional neural network (CNN) to accurately localize in an indoor setting with dynamic foreground/background.
6: Then, we design a new DRL algorithm named Momentum Policy Gradient (MPG) for continuous control tasks and prove its convergence.
7: We also show that MPG is robust at tracking varying leader movements and can naturally be extended to problems of formation control.
8: Leveraging reward shaping, features such as collision and obstacle avoidance can be easily integrated into a DRL controller.
9: 
10: \end{abstract}
11: